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DECLARATION UNDER 37 C.F.R. 1 .1.12 
I, Robyn Russell of Canberra, Australia declare that: 

1. I am one of the inventors of the subject matter of U.S. Patent Application Serial No. 
09/776,910 (the present application) which is a divisional of U.S. Patent Application Serial 
No. 09/068,960 (now U.S. 6,235,515) originaUy filed as PCT/AU96/00746 filed on 22 
November 1996 which claims priority firom Australian Provisional Application No. PN 6751 
filed on 23 November 1995. 

2. My qualifications and technical experience are set out in my Curriculum Vitae, a copy of 
-which is attached as Annexure A. 

3. The present plication relates to the isolation and characterisation of esterases with the 
ability to hydrolyse organophosphates such as malafliion. 

4. It is my understanding that the following claim is pending in the present application: 

P. A recombinant enzyme capable of hydrolyzing at least one organophosphate 
selected from the group consisting of carboxylester organophosphates and 
dimethyl-oxon organophosphates, wherein the recombinant enzyme comprises 
an amino acid sequence which is at least about 75% identical SEQ ID NO:8. 
wherein the recombinant enzyme comprises amino acid residue conserved 
between the sequences provided in Figure 4 with the exception that the 
recombinant enzyme comprises an amino acid selected from the group 
consisting of Leu. Ser. Ala. J7e. Vol. Thr, Cys. Met and Gly at position 251. 

5. As outlined in the present ^plication, myself and co-inventois endeavoured to identify an 
enzyme which confers organophosphate resistance \spon the Dipteran Z«ci/za cuprina. 
Following a number of failed attempts, myself and co-inventors isolated and characterised a 
molecule which was found to be a mutant of an akeady known esterase referred to in the art 
as ••E3". 

6. Since the present ^plication was filed myself and co-workers have made a further protein 
encompassed by pending claim 9. Li this protein, the naturally occurring serine at position 
250 has been replaced with a proline. This mutant possesses organophosphate hydrolvsine 
activity (see Table below). 



Enzyme 


Titre 

(pmol jj"*) 


dECP 
A:cat (min"*) 


dMUPAct 
(min*) 


E3W251L 


6.9 ±2.0 


0.0092 ± 0.0004 


0.0610 ±0.0060 


E3P250S/W251L 


6.7 ±0.80 


0.0036 ± 0.0003 


0.0180 ±0.0000 
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7. My laboratory has also made a mutant of the protein fiiom Drosophila melanogaster which is 
orthologous to the Lucilia cuprina and Musca domestica proteins disclosed in the present 
q>plication. The mutated version of the D, melanogaster protein is 63% identical to the 
clainaed L cuprina molecule, and 76% identical to the M domestica protein, and maintains 
the biological activity defined in pending claim 9. Alignments of these proteins are provided 
in Annexure B. A comparison of the biological activity is provided in the following Table. 



Enzyme 


Titre 

(pmol pi'') 


dECP 
Acat (mm'*) 


(min-*) 


E3W251L 


6.9 ±2.0 


0.0092 ± 0.0004 


0.0610 ±0.0060 


EST23W251L* 


0.9 


0.0060 ±0.0004 


0.0210 ± 0.0003 



* EST23 is the designation of the D. melanogaster ortholog of i. cuprina E3 



8. My laboratory has made many other mutants of the protem provided as SEQ E) NO: 8 in the 
present application. I do not recall producing any mutants which are encompassed by 
pending claim 9 which lacked organophosphate hydrolysing activity. 

9. As outlined herein, the proteins described in the present ^plication are esterases. Esterases 
form a large protein family with multiple esterases with varying activity being foimd in many 
different organisms such as animals (including insects and mammals), plants, fungi and 
bacteria. 

10. At the earliest priority date, namely 23 November 1995, it is my opinion that the 
structure/fimction relationship of various esterases was well understood. In particular, 
important functional domains had been identified, and the skilled person was well aware of 
important conserved residues. 

1 1 . With regard to Dipterans, our understanding of different esterases in this faiAily of organisms 
was also well advanced at 23 November 1995. For instance, a cluster of esterases had aheady 
been identified and characterised firom another Dipteran, namely Drosophila melanogaster, 

A copy of publications describing this large family of esterases, and structural/functional 
characteristics thereof) is attached as Annexure C (namely, Oakeshott, J.G., van Papenrecht, 
E.A., Boyce, TM., Healy, M.J. and Russell, R.J. (1993) Evolutionary genetics of Drosophila 
esterases. Genetica 90: 239-268, and Kaiotam, J., Delves, A.C. and Oakeshott, JG. (1993) 
Conservation and change in structural and 5* flanking sequences of esterase 6 in sibling 
jDro5cpAz7a sequences. Genetica 88: 11-28). 



Dated this. 2005 



RobKsissell 



ALIGNMEN T 0¥ LUCILIA CUPRINA. MUSCA DOMESTICA AND 
DROSOPHILA MELANOGASTER PROTEINS 



MNFWSLMEKLKWKIKCIENKPLNYRLTTNET^ 6 0 

MNFKVSQMEIOJSWKLKCM\mKYTtm^STNETQIIDTEYGQIKGVKRM^^ 60 

Dm MNKNIXSFVERLRWRLKTIEHKVQQYRQSTNETWADTEYGQVRG^ 6 0 

** ::, :*:* *::* : :* ;** . :****;::*:**:;:** ,*.***^ 

IPYAQPPVGELRFKAPQRPTPWDGVRDCCNHKDKSVQVDFITGKVCGSEDCLYLS\^^ 12 0 

IPYAKPWGELRFKAPQRPVPWEGTODCCGPANRSVQTDFISGKPOXSSEDCLYLN^ 120 

Dm IPYAQPPVGELRFKAPQRPIPWERVRDCSQPKDKAVQVQFVFDKVEGSEDCLYIJN^ 12 0 

LNPETKRFVLWIHGGGFIIGENHRDMYGPDYFIKKDVVLINIQYRLGALGFLSLNS^ 180 

I^PDKKRPVMVFIHGGDFIFGEANRNWFGPDYFMKKPV\^VTVQYRI^^ 180 

Dm VKPDKARPVMVWIHGGGFIIGEANREWYGPDYFMKEDVVLVTIQYRLGALGFMSL^^ 180 

NVPGNAGLKDQVMALRWIKNNCANFGGNPDNITVFGESAGAASTHYMMLTEQTR 240 

NVPGNAGLKDQVMALRWVKSNIANFGGDVDNITVFGESAGGASTHYMMIT 240 

Dm NVPGNAGLIOJQVIJUiK&aKNNCASFGGDPNCITVFGESAGGASTHY^ 240 

I^^SGNAICPIANT-QCQHRAFTIAKIAGYKGEDNDK^ 299 

IMMSGNSMCSIAST-ECQSRALTMAKRVGYKGEDNEKDILEFLMKANPYDLIKEEPQV^^ 299 

L ILQSGSAICPLAYNGDITHNPYRIAKLVGYKGEDNDKDVLEFLQNVKAKDLIRVEEN^ 300 

*: **.::*.** . : :** ,*******.★*.*★*★ . ***. * .*** 

LEERTNKVMFPFGPTVEPYQTADCVLPKHPREIWKTAWGNSIPTMMGOT 359 

PEEMQNKVMFPFGPTVEPYQTADCWPKPIREIWKSAWGNSIPTLIGNTSYEGLLFKSIA 359 

LEERMNKIl^AFGPSLEPFSTPECVISKPPKEMMKTAWSNSIPMFIGNTSYEGLLWVPEV 360 
** **:**.***::★*:.*.-**.,* .**.*.★***** ..********.. 

KQMPMLVKELETCVNFVPSELADAERTAPETLEMGAKIKKAHVTGETPTADOT 419 

KQYPEVVKELESCVNYVPVffiUUDSERSAPETLERAAI VKKAHVI^ 419 

KLMPQVLQQLDAGTPFIPKELLATEPSKEKLDSWSAQIRDVHRTGSESTPDNYMDLCSIY 42 0 



* *::::*'• ..*** .* 



YFWFP14HRLLQLRFNHTSGTPVYLYRFDPDSEDLINPYRIMRSGRGVKG7SHADELTYFF 479 

YFLFPMHRFIK3LRFNHTAGTPIYLYRFDFDSEEIINPYRIMRFGRGVKGVSHADELT^ 479 

YFVFPALRVVHSRHAYAAGAPVYFYRYDFDSEELIFPYRIMRIiGRGVKGVSHADDLSYQF 480 
** ** *. :::*:*:*:**:*****::* ****** ***********.*.* ★ 

WNQLAKRMPKESREYKTIERMTGIWIQFATTGNPYSNEIEGMEWSWDPIKKSDEV^ 539 

WNILSKRLPKESREYKTIERMVGIWTEFATTGKPYSNDIAGMENLTWDPIKKSDDVYKCL 539 

SSIilJUUlLPKESREYRNIERWGIWTQFAATGOTYSEKINGMDTLTIDPVRKSDW^ 540 



NISDELKMIDVPEMDKIKQWESMFEKHRDLF- 
NIGDELKVMDLPEMDKIKQWASIFDKKKELF- 
NISDDLKFIDLPEWPKLKVWESLYDDNKDLLF 



570 
570 
572 



ALIGNMENT OF LUCILIA CUPRINA AND DROSOPHILA 
MELANOGASTER PROTEINS 



LC MNFWSLMEKLKWKIKCIEl^I^mjTTNEa^ 60 

Dm MNKNLGFVERLRWRLKTIEHKVQQYRQSTl^TWAOT 60 
** *:.::*:*:*::* **:*. :** :*******:****:*:*.***..>* ^*,**** 

Lc IPYAQPFVGELRFKAPQRPTPWDGVRDCCNHKDKSVQVDFITGKVCGSEIX:LYLS^^^ 120 

Dm IPYAQPPVGELRFKAPQRPIPWERVRDCSQPKDKAVQVQFWDK\^GSEDCLYLNVYTm 120 
******************* **, **** . ***,***.★. ************* 

X^ETKRPVLVYIHGGGFIIGEmGU^MYGPDYFIKKDVVLINIQYRL^^ 180 

Dm VKPDKARPVMVWIHGGGFI IGEANREWYGPDYFMKEDVVLVTIQYRIX^^ 180 
,.*._ ***.*.********** .*. ******.*.****. **********.**.* ,it 

NVPGNAGLKDQVl^RWIKNNCAOTGGNPDNITVFGESAGAASTHYMMLTEQra 240 

Dm WPGNAGLKDQVIJUJKWIKmJCASFGGDPNCITVFGESAGGASTHYMMLTDQ^ 240 
************.**.**********.*, *********^*****^t***,^^,^^^^^^ 

ILMSGNAICPIAm'-QCQHRAFTIAKLAGYKGEDl^KDVLEFI^^ 299 

Dm ILQSGSAICPIAYNGDITHNPYRIAKLVGYKGEDNDKDVLEFLQNVKAKDLIRVEENVLT 300 
** ******** ^ . *,.: .*** ^ ************** * :.*.:***::**;*** 

LEERTNKVMFPFGPT\^PYQTADCVLPKHPREM\^TAWGNSIPTMMGNTSYEGLFFTSIL 359 

Dm LEERMl^IMFAFGPSLEPFSTPECVISKPPKEMMKTAWSNSIPMFIGNTSYEGLLWVPEV 360 

**** **:**.*** J ;**; ^ * ^ .**._ * *-**.******** ..***♦****.. 

LC KQMPMLVKELETCVNFVPSELADAERTAPETLEMGAKIKKAHVTGETPTADNFMDI^ 419 

Dm KLMPQVLQQLDAGTPFIPKELI^TEPSKEKLDSWSAQIRDfVHRTGSESTPDNYK^ 420 
* ** ::::*:: . *.*^** ,* . . ^ *.*. * **^ _*^**-***** 

YFWFPMHRLLQLRFNHTSGTPVYLYRFDFDSEDLINPYRIMRSGRGVKGVSHADELT^ 479 

Dm YFVFPALRWHSRHAYAAGAPVYFYRYDFDSEELIFPYRIMRLGRGVKGVSHADDLSYQF 480 

** ** *. -;;*.***.**.*****.** ****** ***********.*.* * 

Lc VmQLAKRMPKESREYKTIERMTGIWIQFATTGNPYSNEIEGI^^ 539 

Dm SSLIiARRLPKESREYRNIERWGIWTQFAATGNPYSEKIN(3M^ 540 
_ **.*.*******. *** _*** ***.******,.*,**.,. **..***** i^ifi, 

Lc NISDELKMIDVPEMDKIKQWESMFEKHRDLF- 570 

Dm NISDDLKFIDLPEWPKLKVWESLYDDNKDLLF 572 

****,**,**.** *.* *** 
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Abstract 

Over 30 carboxylester hydrolases have been identified in D. melanogaster. Most are classified as acetyl, 
carboxyl or cholinesterases. Sequence similarities among most of the carboxyl and all die cholinesterases so 
far characterised from D. meUinogaster and other eukaryotes justify recognition of a carboxyl/cholinesterase 
multigene family. This family shows minimal sequence similarities with other esterases but ciystallographic 
data for a few non-drosophiUd enzymes show that the family shares a distinctive overall structure with some 
other carboxyl and axyl esterases, so they are aU put in one superfamily of /p hydrolases. Fifteen esterase 
genes have been mapped in D. melanogaster and twelve are clustered at two chromosomal sites. The 
constitution of each cluster varies across Drosophila species but two carboxyl esterases in one cluster are 
sufficientiy conserved tiiat tiieir homologues can be identified among enzymes conferring insecticide resis- 
tance m otiier Dipteia. Sequence differences between two otiier esterases, the EST6 carboxyl esterase and 
acetylcholinesterase, have been inteipreted against the consensus super-secondary structure for the carboxyl/ 
cholinesterase multigene family; tiieir sequence differences are widely dispersed across the stmcture and 
mclude substantial divergence in substrate binding sites and tiie active site gorge. This also applies when 
EST6 is compared across species where differences in its expression indicate a difference in function. 
However, comparisons witiiin and among species where EST6 expression is conserved show tiiat many 
aspects of die predicted super-secondary structure are tightly conserved. Two notable exceptions are a pair 
of polymorphisms in tiie substrate binding site of tiie enzyme in Z>. melanogaster. These polymorphisms are 
associated witii differences in substrate interactions in vitro and demographic data indicate tiiat die alternative 
forms are not selectively equivalent in vivo. 



Introduction 

It has been a cornerstone of evolutionary theory 
that the diversity of biochemical functions required 
in complex multicellular organisms has been 
achieved by the evolution of multigene families. 
Two types of component process are proposed. 
First, there must be an amplification of gene copy 
number through a set of duplication events. This, in 
turn, allows some of the duplicated genes to diverge 
from the ancestral gene and adopt new functions, 
without compromising the ability of other members 
of the duplicated family to carry out die ancestral 
function. The first process, the amplification of 
copy number, has been exemplified many times, 



through both classical genetic and molecular evi- 
dence for clusters of closely related genes. How- 
ever, the subsequent process of functional diver- 
gence has proven less tractable because it requires 
detailed biochemical, genetic and molecular analy- 
ses. 

One set of gene/enzyme systems that is now 
proving amenable to all three approaches is a sub- 
group of esterase enzymes that preferentially hy- 
drolyse esters of carboxylic acids (E.C. 3. 1 . 1). As is 
common usage we simply term this subgroup the 
esterases. It excludes a multitude of phospho- and 
thio-esterases, phosphatases and sulphatases but 
even so encompasses several multigene families. 
Much of the work to date has focussed on verte- 
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brate and to a lesser extent microbial esterases; 
sequence data are now available for many of these 
and crystal structures have also been solved for a 
few. However, a second strand of work has been 
exploiting the unique advantages of Z>. melanogas- 
ter to probe genetic and molecular aspects of the 
esterases tiiat would be intractable with other 
higher organisms. The following review focusses 
on this second strand of work, but also seeks to 
reinterpret many of the findings in the light of the 
sequence-structure-function models developed 
from the crystal structures of the related enzymes in 
the first strand. 



Functional dassUQcation of esterases 

Mammalian and microbial precedents 
While the substrate specificities of some esterases 
may be quite narrow in viVo, many hydrolyse a 
broad and overlapping range of substrates in vitro. 
One advantage of this has been that many can be 
assayed electrophoretically, by coupling their hy- 
drolysis of various synthetic naphthyl esters to the 
colorimetric conversion of certain dyes. Over 20 
esterase isozymes have been resolved by these 
means in each of several vertebrate species (e.g., 
Hohnes & Masters, 1967; Coates, Mestriner & 
Hopkinson, 1975). However, the disadvantage of 
these broad substrate ranges is that other criteria are 
also required to achieve a functional classification 
of esterase activities. The classification most 
widely used relies primarily on sensitivities to diag- 
nostic concentrations of three groups of inhibitors, 
namely sulfhydryl reagents (typically p-chloromer- 
curibenzoic acid, or pCMB), organophosphates 
(OPs, such as paraoxon, fenitrooxon and diisopro- 
pyl fluorophosphate, or DFP) and eserine sulfate 
(Holmes & Masters, 1967). Four classes of en- 
zymes arc discriminated on these criteria: 

Acetyl esterases, which are not affected by any of 
the inhibitors and generally prefer aliphatic sub- 
strates involving acetic acid. 

Aryl esterases, which are only inhibited by the 
sulfhydryl reagents and generally prefer aromatic 
substrates. 

Carboxyl esterases, which are only inhibited by 
the OPs and prefer aliphatic esters, generally of 
longer acids than acetic acid, and 

Cholinesterases, which are inhibited by both 



OPs and eserine sulfate and prefer charged sub- 
strates like cholinesters over other aromatic or aU- 
phatic esters. 

At least half the esterases detectable by system- 
atic electrophoretic surveys of mamtn^tjaT^ species 
prove to be carboxyl esterases, most of the remain- 
der comprising similar numbers of the other three 
classes (Ecobichon & Kalow, 1965; Holmes & 
Masters, 1967). However, a few mammalian 
esterases have now been described that do not fit 
readily into any of these four classes. One example 
from humans is butyrylesterase, which is inhibited 
by pCMB and OPs but not eserine sulfate (Hjoring 
& Svensmark, 1988). 

Some but not all aspects of this mammalian- 
based classification system appear to hold for 
esterases in organisms as distantly related as 
prokaryotes. Enzymes classifiable as aryl and ace- 
tyl esterases agamst the mhibitor criteria have been 
described from bacteria (e.g., Choi et al,, 1990; 
Luthi et aL, 1990). However, we are unaware of 
any bacterial enzymes that would be classified as 
carboxyl or cholinesterases on these criteria. In 
fact, many bacterial esterases fall outside all four 
inhibitor-based classes (e.g., Raymer, Willard & 
Schottel, 1990, and references therein). 

Drosophila and other insects 

Twenty-two soluble esterase isozymes have been 
detected by native polyacrylamide gel electro- 
phoresis (PAGE) of individual tissues and defined 
life stages of D. melanogaster (Healy, Dimiancic & 
Oakeshott, 1991). At least seven more can be re- 
solved if a second dimension of electrophoresis 
involvmg isoelectric focussing is applied (Healy, 
Dumancic & Oakeshott, 1991; Campbell, Healy & 
Oakeshott, 1992). Most of these esterases are prob- 
ably encoded by different genes. The only excep- 
tions found so far are four isozymes of ace- 
tylcholinesterase (AChE) that are all encoded by 
the one gene (Ace\ Arpagaus, Foumier & Toutant, 
1988, and see below). All five non-AChE isozymes 
which have been mapped genetically are inherited 
as single and distinct genes (see below) and this 
accords well with mammalian precedents. In 
mouse, for example, over 30 different non-AChE 
isozymes have been described (Ruddle & Harring- 
ton, 1967) and so far 23 have been shown to be 
products of. different genes (O'Brien, 1990; Von 
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Deimling & Wassmer, 1991; Von Deimling & Gaa, 
1992). 

The electrophoretic survey of Healy, Dumancic 
and Oakeshott (1991) would not have detected 
some esterases in D. melanogaster that either can- 
not hydrolyse any of the naphthyl esters used (e.g., 
most lipases and malathion carboxyl esterase, or 
MCE; Smyth et al, in preparation), or are tightly 
associated with membranes and require detergent 
treatment to solubilise {e.g„ EST23; Spackman et 
aL, 1993). Add these to the 29 soluble enzymes 
detected by electrophoresis or isoelectric focussing 
and it seems likely that there will be well over 40 
esterases in Z). melanogaster. 

Inhibitor analyses of the 22 soluble isozymes 
detected by native PAGE plus the membrane- 
bound EST23 reveals eleven carboxyl esterases, six 
cholindsterases and three acetyl esterases, with only 
three isozymes not clearly classifiable (Table 1). 
Moreover, the carboxyl and cholinesterases can 
eacli be divided further into subclasses on the basis 
of iohibition by OPs and pCMB. The five subclass 
I carboxyl esterases are qualitatively more sensitive 
to inhibition by OPs than the six subclass U 
isozymes, while the foxir subclass I cholinesterases 
(the four AChE isozymes) are much less sensitive 
to pCMB inhibition than are the two subclass n 
enzymes. The fact that no aryl esterases have yet 
been recovered from D. melanogaster does not nec- 
essarily indicate their absence in this species, since 
most of the mammalian aryl esterases characterised 
by Holmes and Masters (1967) have relatively high 
electrophoretic mobility and relatively low stability 
under heat or urea treatment If the aryl esterases of 
D. melanogaster behave in a similar way they 
would probably not have been detected under the 
conditions used to siurey this species (Healy, 
Ehunancic & Oakeshott, 1991, and references 
therein). Two carboxylesterases (esterase 6, or 
EST6, and juvenile hormone esterase, or JHE; 
White, Mane & Richmond, 1988; Campbell, Healy 
& Oakeshott, 1992) and two cholinesterases (AChE 
and EST9; Foumier et al, 1988; Morton & Smgh, 
1985) have been substantially purified from 
Z). melanogaster and a few others have been simi- 
larly characterised from other drosophilid (see be- 
low) and non-drosophilid insects (e.g., Devonshire, 
1977; Kao, Motoyama & Dauterman, 1985; Ziegler 
etal, 1987; Mouches etal, 1987; Abdel-Aal etal, 
1988; Field et aL, 1993). Like theif mammalian 



Table 1, Inhibitor classification of 23 esterase isozymes from 
D. melanogaster {ftomHeaiy, Dumancic & Oakeshott, 1991 and 
Spackman « a/., 1993). 



Innibitor-basea class 


Isozyme 


Carboxyl esterase 




Subclass 1 


EST12; EST15; EST16; EST18; 




EST23 


Subclass 2 


ESTl; EST2; EST6; EST14; EST17; 




EST22 


Choiinesterase 




Subclass 1 


EST4; EST5; EST8; EST13* 


Subclass 2 


EST9; ESTIO 


Acetyl esterases 


ESTI9; EST20; EST21 


Miscellaneous 


EST3;EST7; ESTll 



* All four subclass I cholinesterases are isozymes of AChE. 



counterparts, almost ail have a subunit size between 
60 and 70 KDa, with a single active site per subunit. 
The only exception is a termite carboxyl esterase, 
which only has a subunit size of 40 KDa (Sreerama 
& Veerabhadrappa, 1991). The only other esterases 
as small as this are several aryl esterases and some 
others that cannot be classified against the inhibitor 
criteria (Ganef a/., 1991; OUis etal, 1992; Cygler, 
Schrag & Ergan, 1992; Cygler et al, 1993). No 
insect aryl or acetyl esterase has yet been purified. 

Esterase gene fsimlMes 

The a/p hydrolase superfamily 
Over 50 esterase genes have now been cloned, 
mainly from vertebrates and bacteria but also sev- 
eral from fungi and insects. Most of the bacterial 
enzymes and a few eukaryotic esterases that caimot 
be classified on the inhibitor criteria show no re- 
cognisable sequence similarity with the other sys- 
tems (Ounissi & Courvalin, 1985; Markovic & 
Jomvall, 1986; Ray etal, 1988; Choi etal, 1990; 
Raymer, Willard 4& Schottel, 1990; Zschunke et aL, 
1991), suggesting that enzymes with esterase activ- 
ity may have multiple origins. However, most 
eukaryotic carboxyl and cholinesterases, including 
all the esterases so far cloned from insects and a few 
bacterial esterases that are either aryl esterases or 
imclassifiable on the inhibitor criteria, do show at 
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least some sequence similarity (OUis et al, 1992; 
Cygler, Schrag & Ergan, 1992; Cygler et al, 1993; 
see also Okada & Wakabayashi, 1988; Farrell etal, 
1990; Van der Meer et al, 1991). On this basis, a 
small proportion of the bacterial esterases but a 
substantial proportion of cukaryotic esterases 
would seem to have evolved from just one of the 
ancient lineages. For reasons that we now elaborate 
this lineage is called the ce/3 hydrolase superfamily. 

Several enzymes m this lineage, including bacte- 
rial aryl esterases and eukaryotic carboxyl and 
cholinesterases, have been crystallised and their 
tertiary structures resolved (Cygler, Schrag & Er- 
gan, 1992; OUis et al., 1992; Van Tilbeurgh et aL, 
1993, for references), enabling sequence similari- 
ties to be related to structural similarities. Two 
diajgnostic features of the primary sequence that 
contribute to the active site have been found to 
recm- across these enzymes. Moreover, sequence 
comparisons and in vitro mutagenesis indicate that 
these features also occur in other members of the 
lineage that have not yet been crystallised (e.g,, Di 
Peisio, Fontame & Hui, 1990, 1991; Di Persio & 
Hui, 1993, and references therein). No insect est- 
erase has yet been crystallised but the sequences of 
all those whose genes have been cloned contain 
these feattires (Cygler al., 1993). 



The first feature is a triad of non-contiguous 
residues, generally Ser-Asp-His, otherwise Ser- 
Glu-His or Clys-Asp-His (ordered according to their 
occurence m the primary sequence). This triad 
forms a charge relay that executes the hydrolytic 
reaction by donating a proton to the ester bond of 
the substrate. Significantly the same residues are 
involved in the catalytic triad found in many pro- 
teases, but the order of the, residues in the primary 
sequence of all the esterases in this lineage diffas 
ftom the proteases. Hie only esterases Imown to 
have a different order for their triad are the rodent 
*granzymes' and their human 'serine esterase' 
homologues, which show a trypsin-like order for 
their catalytic triad and are generally placed in the 
serine protease family (Zschunke et ai, 1991, and 
references therein). Of the three alternatives found 
in the esterases in the a/3 hydrolase superfamily, 
the Cys-Asp-His is found in the (bacterial) aryl 
esterases sequences, Ser-Glu-His in the eukaryotic 
cholinesterases and some of the eukaryotic car- 
boxyl esterases, and Ser-Asp-His in a minority of 
the eukaryotic carboxyl esterases. This explains in 
part why OPs (which bind Ser) do not inhibit aryl 
esterases but do inhibit the others. 

The second feature of primary sequence shared 
by esterases in the aj^ hydrolase lineage is the 




Po Pi Pa Ps P6 P7 Pa 



Fig 1, Schematic diagram of the o/p hydrolase fold which Cygler et al (1993) propose is shared by aU members of the carboxyl/ 
cholmesterase multigene family, p strands are shown as arrows and a heUces are boxed. Following Cygler et al (1993) p strands that 
ccmtnbute to the small and large b sheets are denoted p or p respectively, with their subscripted numbers indicating their order in the 
primary sequence. Subscripts for the a helices show the p strands they separate. The loops contributing to die catalytic triad are also 
shown. Coordmates for all structures in the primary sequences of AChE and EST6 are given in T^ble 1. 
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lcx:alisation of the Ser/Cys nucleophile of the cata- 
lytic triad in a highly conserved Gly-x-Ser/Cys-x- 
Gly pentapeptide. This motif is critical for the 
placement of the Ser/Cys at the apex of an ex- 
tremely sharp turn and gives it ready access to the ' 
His of the triad on one side and the substrate on the 
othet This nucleophile pentapeptide, like the triad 
above, is also found in some protease families but, 
again excepting the granzyme/serine esterase 
group, its tertiary structure context is completely 
different, suggesting independent evolutionary ori- 
gins. 

Many of the esterases in the a/3 hydrolase line- 
age show litde sequence similarity witii one another 
outside their catalytic triad and nucleophile pen- 
tapeptide. Yet the secondary and tertiary structures 
of five members of the group whose higher order 
structures have been resolved show substantial sim- 
ilarities with each other (Ollis et al, 1992). Al- 
though their subunit sizes vary from about 25 to 60 
KDa, they all show an internal bacldx)ne called the 
major p sheet, which comprises at least eight 3 
strands. Wth a few exceptions, these strands show 
the same orientation with respect to each other and 
occur in the same order in the primary sequence. A 
minor p sheet of three strands and unknown func- 
tion also occurs towards the amino terminus of the 
eukaryotic enzymes. Loops between the strands of 
the major p sheet include variable numbers of a 
helices and, amongst other functions, provide pro- 
tection and stability to the major p sheet The loops 
at the carboxy termini of (generally) strands p^, p^ 
and pg in the major p sheet juxtapose the triad 
residues, while excursions of variable size, mainly 
after strands % and p^, and before strand p^, 
provide much of the substmte binding structure for 
the active site (see Fig. 1 and Tbble 2). Ollis et al, 
(1992) term this overall stracture the a/p hydrolase 
fold and they argue that the extent of its conserva- 
tion, even among enzymes with minimal primary 
sequence similarity, must reflect common, albeit 
ancient, ancestry. 

The carboxyl/cholinesterase muUigene family 

Some members of the a/p hydrolase superfamily 
have non-esterase hydrolytic activities. However 
most of those characterised to date are esterases and 
these have been partitioned into six major families 
(Cygler. Schrag & Ergan, 1992). Between families 



Tiible 2. The empirical relationship between primary and secon- 
dary structures for AChE from T caUfomia, and the predicted 
relationship between these strucmrcs for EST6 from D, melan- 
ogaster. The AChE relationship and the nomenclature for se- 
condary structures are taken from Sander and Schneider (1991) 
and Cygler et al (1993). The EST6 relationship is based on the 
model of Cygler et aL (1993) and the alignment of EAvP. See 
also Figure 1 fOr a graphical representation of the secondary and 
supersecondary structures. 



Primary sequence 



AChE 



418-423 
444-447 
46(M79 
501-505 
512-514 
518-532 



EST6 



Secondary structure 



Type 



Name 




* These helices are present in a closely related lipase from the 
fungus Geotrichum candidum, but absent in both AChE and 
EST6. 

t a*7 8 in EST6 is interrupted by a 10 residue in- 
sertion. 
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48% 



38% 



48% 



77% 



69% 



100% 



-Esterase B1, Culexpipfens 

•Juvenile hormone esterase. Heliothis virescens 

- Esterase 6, Drosophiia meianogaster 

• Upase 1 . Geotrfchum eandidum 



— Sterol esterase, Ho^o sapiens 

— L»ver cartx>xylesterase. Homo sapiens 



-Aoetytehoflnesterase, Homo sapiens 
-Butyryfchollnesterase, Homo sapiens 
-Acetylcholinesterase. Drosophiia melanogaster 
-Thyrogtobulin. Homo sapiens 



Fig. 2. Stnct consensus networkfor the infened amino add ^ 

Sequences w^ahgned as for Tables 1 and 2. Distances among sequences were calculated using the PAM distance mattix (Dayhoff 
S<liwart2 & Oicutt. 1978) on one hundred randomly re-sampled (bootstrapped) lepUcates and networks were constructed using lii^ 
neighbor jommg method (Saitou&Nei, 1987) a? implemented in PHYLIP version 3.4 (Felsenstein. 1991). The network is tooted with 
tiie human thyroglobulm sequence. Numbers to the left of each node fedicatethepetcentageofttq)Hcatesinwhichthatnodewasformed 
Unc lengths are not proportional to distances. 



primary sequence similarity is largely confined to 
the catalytic triad and nucleophile pentapeptide, but 
within families sequence similarities are much 
higher. One family contains several bacterial aiyl 
esterases, the dienelactonc hydrolases (Van der 
Meet et al, 1991; Ollis et ai, 1992). Hien there are 
four major families of lipases, two so far confmed 
to bacteria and vertebrates, respectively, and two 
that include both fungal and vertebrate enzymes 
(plus at least three smaller families of lipases and 
cutinases). Finally, the largest and best character- 
ised family includes some fungal lipases, some 
other eukaryotic carboxyl esterases and all the 
cholinesterases so far sequenced (Clygler, Schrag & 
Ergan, 1992, Cygler et al, 1993). We tenn this 
latter family the carboxyl/cholinesterase multigene 
family. It covers all the insect esterases so far se- 
quenced. 

Sequences have been published for just over 30 
carboxyl/cholinesterases. Seven are orthologous 
AChEs from different species, but most of the oth- 
ers are clearly paralogous. Across the whole family 



the average pairwise sequence identity is 29% and 
this figure rises to 36% for a stretch of just over 300 
residues which lies in the amino terminal part of 
most of the proteins (see below for three excep- 
tions). Higher order structures are known for two of 
the enzymes, a fungal lipase (Schrag & Cygler, 
1993) and a vertebrate AChE (Sussman et al., 
1991; Rippol et al, 1993). Although the functions 
and sequences of these two enzymes make them 
quite distant relatives within the family (see Fig. 2), 
they nevertheless show remarkable similarity in 
higher order structure. It has therefore been possi- 
ble to predict with some confidence how the pri- 
mary sequences of other family members contrib- 
ute to secondary and super-secondary, if not terti- 
ary, structure. For relatively closely related family 
members, like AChE and butyiylcholinesterase 
(BChE), where overall sequence identity exceeds 
50%, the predictions can be extended to the point of 
identifying particular residues as candidates to ex- 
plain very specific differences in substrate speci- 
ficities. In some cases these predictions have been 
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validated by functional analyses of in vitro muta- 
tions of the candidate residues (e.g., Vellom et aL, 
1993). 

As with some of the other families of a/p hydro- 
lases, the carboxyl/cholinesterase group includes a 
few proteins without esterase activity. In this case 
three family members have been identified that 
have no known enzymic activity and lack the cata- 
lytic triad and Gly-x-Ser/Cys-x-Gly pentapeptide. 
These exceptional proteins are the mammalian 
hormone precursor thyreoglobulin and two otiier cell 
adhesion molecules, glutactin and neurotactin 
(Mercken et a/., 1985; Olson et al., 1990; De la 
Escalera et aiy 1990). All three show high sequence 
similarity to other family members in the amino 
terminal 300 residue stretch that includes most of 
the essential scaffolding of p strands in the ct/p 
hydrolase fold. While these p strands at least will 
be substantially confined to the protein interior, 
some loops between these strands must detour to 
the exterior to confer cell adhesion properties. This 
has been shown by functional aiudyses of various 
chimeric mutant proteins in which segments have 
been exchanged among glutactin, neurotactin and 
AChE (some forms of which are also associated 
with membranes) (Piovant et aL, 1993). 

Just as there are some members of the carboxyl/ 
cholinesterase family that have no known hydro- 
lytic activity, there are also others which have two 
such activities, specifically amidase as well as est- 
erase activities (Heymann, 1980; Richmond et aL, 
15>90, and references therein). Intriguingly, in the 
one case analysed mechanistically, a mammalian 
sterol esterase, the two activities appear to be at least 
partly independent (Hui, Hayakawa & Oizumi, 
1993). Both activities are susceptible to OP inhibi- 
tipn, suggesting a key Ser residue in the active site, 
although it has not been formally established that it 
is the same Ser in both cases. However, the His in 
the catalytic triad required for esteratic activity can 
be mutated to Ghi, abolishing esterase activity as 
expected but, surprisingly, leaving the amidase ac- 
tivity largely unaltered. 

The carboxyl/cholinesterase split 

Figure 2 shows a neighbor joining network (Saitou 
& Nei, 1987; Felsenstein, 1985) relating the in- 
ferred amino acid sequences of nine members of the 
carboxyl/cholinesterase multigene family. Four of 



them are from insects, four from humans and one 
from a fungus. Apart from the Drosophila and hu- 
man ACThEs, the network only includes genes that 
are clearly paralogous and the functions of which 
are known, at least in part. Human thyroglobulin is 
also included as a non-enzymic member of the fam- 
ily to provide an outgroup for the network. 

Ibx&t major splits are identified in the network. 
There is a primary separation of carboxyl and 
cholinesterases and then a further bifurcation 
within each of these two lineages. \^thin the for- 
mer the lipases are separated from other carboxy- 
lest^iises and within the latter the human represen- 
tatives are separated from the insect enzyme. Thus 
the functional difference between carboxyl and 
cholinesterases is reflected in their sequence diver- 
gence, regardless of the taxa from which they origi- 
nate. Furthermore, since both the carboxyl and 
cholinesterase lineages include insect and human 
genes, the carboxyl/cholinesterase split is likely to 
have preceded the divergence of vertebrates and 
invertebrates. Indeed, it could be even older, since 
the lipase lineage within the carboxyl esterases in- 
cludes both a fungal and a vertebrate enzyme. On 
the other hand, within the cholinesterases human 
AChE and human BChE share a more recent com- 
mon ancestry than either do with the insect AChE, 
which suggests a relatively recent acquisition of 
BChE function within the vertebrate line. Consis- 
tent with this relationship, no enzyme with obvious 
biochemical homology to BChE has been described 
from Drosophila^ or any other insect. 

Esterase genetics in D. melanogaster 

The previous sections have established a hierarchv 
of evolutionary relationships among various func- 
tional classes of esterases. In order to investigate 
the processes that have given rise to this pattern w o 
must first examine the genomic organisation of the 
cognate genes. The power of Z). melanogaster ge- 
netics suits it well for this purpose. Genes for a total 
of eight esterase activities have now been mapped 
in this species by classical genetics. These include 
three carboxyl and two cholinesterases and three 
whose status against the inhibitor-based criteria has 
not been determined. Several other putative est- 
erase genes have also been discovered during the 
course of sequence analyses of cloned DNA. Col- 
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lating the classical and molecular genetics indicates 
a minimum of 15 genes at five separate chromoso- 
mal sites, one of which contains two genes and the 
other at least ten (Fig. 3). 

The three genes so far attributed to unique sites 
produce enzymes with very diverse functions. 
These are: 

Ace^ which encodes the AChE isozymes, or sub- 
class I cholinesterases. Their preferred naphthyl es- 
ter substrate in vitro is a naphthyl acetate and their 
in vivo substrate is the neurotransmitter ace- 
tylcholme (Foumier et al, 1988, and references 
therein). Hybridisation of Ace clones to various 
chromosomal deficiencies localises it to 87E3 on 
the polytene map (3-52 genetically) of chromo- 
some niR (Spierer et al, 1983). 

£sri7, which encodes the EST17 isozyme, a sub- 
class n carboxyl esterase whose preferred naphthyl 
ester substrate in vitro is a naphthyl proprionate. 
The in vivo function of EST17 is unknown but it is 
largely confined to late larvae (Healy, Dumancic & 



Oakeshott, 1991). Mapping of allozymic variants 
against the rusteca chromosome in 107 test cross 
progeny places Estl7 roughly equidistant from e 
{3'lOn genetically) and ca (3-100.7), which puts it 
distal to Ace on chromosome niR (R Kostakos, 
R.J.R. & J.G.O., unpublished data). 

Est9 ofLoukas (1981). The EST9 isozyme en- 
coded by this gene hydrolyses a naphthyl acetate in 
vitro, but only in the presence of the peptidase 
substrate Heucyl p-naphthyl amide. The latter may 
have some mechanistic analogy to the joint est- 
erase-amidase activities of the mammalian sterol 
esterase (see above). However, in the absence of 
other inhibitor analyses, the status of this EST9 is 
uncertain against the inhibitor criteria. Mapping of 
allozymic variants puts the Est9 of Loukas (1981) 
on chromosome n and, assuming homology with a 
similar enzyme in Z). pseudoobscuray probably on 
nR (Loukas, 1981). 

An enzyme product has so far only beai identi- 
fied for one of the two genes in the smaller of the 
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Fig. J. Oiromosomal locations of D. melanogaster esterase genes. Bars illustrate chipmosomes H and HI, with the left (L) and right (R) 
arms indicated. Cytological locations are indicated above the chromosomes and genetic map positions below. Est9 of Loukas (198 1) has 
not been locaUsed within chromosome HR. Genes with known biochemical phenotypes are in bold; cloned genes for which no 
biochemical phenotypes have yet been identified are in normal typeface. 
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two esterase gene clusters. This pair includes: 
Est6^ which encodes the EST6 isozyme, an abun- 
dant and widely distributed subclass n carboxyl 
esterase whose preferred naphthyl ester substrate in 
vitro is P naphthyl acetate. Its in vivo substrate is 
unknown but physiological and behavioural data 
indicate a role in reproductive fitness (see below). 
In situ hybridisation of cloned Est6 to various chro- 
mosomal deficiencies localises Est6 to 69 A 1 on the 
polytene map (about 3-34 on the genetic map) of 
chromosome IDL (Oakeshott et al., 1987; Procu- 
nier, Smith & Richmond, 1991). 

EstP, which is downstreani from Est6 at 69A1 . It 
lies in the same orientation as Est6 and its initiation 
codon is just under 200 bp 3' of the Est6 termina- 
tion codon (Collet et al^ 1990). It is mainly tran- 
scribed in late larvae but bears no obvious corre- 
spondence with the developmental profile of any 
isozyme detected by Healy, Dmnancic and Oake- 
shott (1991). 

Four biochemically defined esterase activities 
have so far been mapped to the larger cluster of ten 
esterase genes. These four are functionally diverse, 
in that they include both carboxyl and choline- 
sterases on the inhibitor criteria. However, a com- 
mon feature of three of the activities at least is their 
homologies to various esterases implicated in OP 
insecticide resistance in other insects. The genes for 
these four activities are so tightly clustered that 
they have not been separable by deficiency map- 
ping. They are: 

Est9 of Healy, Dumancic and Oakeshott (1991) 
(= EstC of Beckman and Johnson, 1964), which 
codes for the abundant subclass n cholinesterase 
isozyme EST9 (ESTC). The preferred naphthyl sub- 
strates for this enzyme in vitro are a naphthyl pro- 
prionate and butyrate. Its in vivo substrate is unknown 
but the high concentration of the enzyme in the gut 
suggests a role in the digestion of dietary esters 
(see below). Localisation of the gene to 84D3-5 
(about 3-48 genetically) on chromosome niR was 
achieved by deficiency mapping of EST9 allozyme 
variants (Cavener, Otteson & Kaufman, 1986). 

Ali, which has not been related to a specific est- 
erase isozyme but which accounts for the majority 
of in vitro methyl butyrate hydrolysis in a manom- 
etric assay. The in vivo function of ALI is unknown 
but it shows strong biochemical similarity to the 
ALI activity which is greatly reduced in OP resis- 
tant mutants of the house fly Musca domestica (Op- 



penoorth & Van Asperen, 1960). The co-localisa- 
tion of Ali with Est9 above involved assays of ALI 
activities in deficiency heterozygotes (Spackman et 
al, 1993). 

Est23^ which encodes the microsomal subclass I 
carboxyl esterase isozyme EST23. The preferred 
naphthyl ester substrate of this isozyme in vitro is 
a naphthyl acetate. The in vivo function of EST23 
is unknown biit the enzyme shows very close bio- 
chemical similarity with the E3 enzyme which mu- 
tates to an apparentiy null phenotype in OP resis- 
tant mutants of die Australian sheep blowfly 
Lucilia cuprina (Spackman era/., 1993). Co-locali- 
sation of Est23 with Ali and Est9 was achieved by 
deficiency mapping of EST23 allozymes (Spack- 
man et a/., 1993). 

McCy which has not been related to any specific 
esterase isozyme but which encodes the ability to 
hydrolyse a small subset of OPs like malathion 
which carry carboxyl ester linkages in their alcohol 
group (in addition to the phosphoester linkage com- 
mon to all OPs). Its in vivo function is unknown but 
elevated MCE levels have been iiriplicated in ma- 
lathion resistance in several pest species (Ziegler et 
al, 1987; Russell etaL, 1990). Co-localisation witii 
Est9y Ali and Est23 mvolved assays of MCE activi- 
ties in deficiency heterozygotes (Spackman et a/., 
1993). 

Despite the differences in their biochenucal phe- 
notypes, the precise genetical relationships among 
Est9y Ali, Est23 and Mce are as yet uncertain. It is 
clear that Est9 and Est23 are each single and dis- 
tinct genes, because the allozymic variants they 
encode vary independentiy across strains (Spack- 
man et al^ 1993). There is also indirect evidence 
that Est23 is distinct from Mce, in that their homo- 
logues in L cuprina are separable by recombina- 
tion (albeit at very low ftequencies; Raftos & 
Hughes, 1986; Smyth et a/., in preparation). How- 
ever, in the absence of electrophoretic phenotypes, 
we cannot discoimt the possibility that the spectro- 
photometrically or radiometrically determined ALI 
and MCE phenotypes are each due to the products 
of more than one closely linked genes. For ALI in 
particular, it is possible that such genes include 
either or both of &/9 and Estl3. 

On the other hand, molecular data show that it is 
at least possible for all four of the biochemical 
phenotypes to be encoded by distinct genes. A total 
of ten putative esterase genes have been identified 
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from molecular studies of a 90 kb stretch of contig- 
uous DNA at 84D3-E2 (RJ.R., C. Robin, R 
Kostakos, R. Newcomb, L. Court, K, Medveczky, 
D. HartI, T.B. & J.G.O. unpublished data). Prelimi- 
nary analyses indicate that at least eight of the ten 
genes may be active and that they encompass part, 
if not aU of the Est9-Ali-Est23'Mce cluster identi- 
fied by the classical genetics. However, the precise 
correspondence of individual genes detected by the 
two approaches has not yet been elucidated. 

Whatever the details of its molecular basis, the 
clustering of iht Est9-AlUEst23'Mce genes is in- 
triguing in the context of the evolution of the car- 
boxyVcholinesterase multigene family, because 
EST9 is a subclass n cholinesterase and EST23 a 
subclass I carboxyl esterase. Two interpretations 
seem possible, given the phylogenetic distinction 
between at least some carboxyl and cholinesterases 
su^ested by the neighbor joining network in Fig- 
ure'' 2, One interpretation is that the cluster is an- 
cient and pre-dates the divergence of carboxyl and 
cholinesterases. Alternatively, there may have been 
more than one carboxyl/cholinesterase split within 
the family, perhaps one for the subclass I cholin- 
esterases revealed by the network in Figure 2 and 
one for subclass n cholinesterases represented by 
the Est9'Ali-Est23-Mce cluster. 



EvolBtioiiiL of esterase gene famalies across 
Drosopfdh specSes 

Having found evidence for two clusters of esterase 
genes in D. melanogaster, we now take a compara- 
tive approach to examine the evolution of the two 
clusters across Drosophila and other insect species. 
While not studied in the same detail as D. melan- 
ogaster, several other Drosophila species have also 
been surveyed for soluble esterases by electro- 
phoretic analysis, generally of whole fly homoge- 
nates (e,g., Johnson et al, 1966; Johnson, Ri- 
chardson & Kambysellis, 1968; Berger & Canter, 
1973; Mulley, James & Barker, 1979; Baker, 1980; 
Morton & Singh, 1985; Korochkin etal, 1987). As 
many as ten isozymes have been detected in some 
species, but a recurrent finding is the presence of 
two intensely staining non-AChE isozymes m ex- 
tracts fix)m a variety of life stages. 

In Z). melanogaster these isozymes are EST6 
from the small gene cluster above and the EST9 of 



Healy, Dumancic and Oakeshott (1991) from the 
larger cluster. However, the corresponding 
isozymes take other names in many of the other 
species. Generally, the two isozymes have been 
simply distinguished by theh in vitro preference for 
ot versus p naphthyl ester substrates (which gives 
rise to differendy coloured bands when coupled 
with several histochemical dyes). This o/p prefer- 
ence itself does not necessarily indicate a large 
biochemical difference; another minor isozyme of 
Z). mojavensis is actually polymorphic for a- and 
p-preferring forms (Zouros & Van Delden, 1982). 
In the case of the two major isozymes however, it 
is consistently associated throughout the genus 
with a number of other distinct biochemical and 
physiological properties. 

The Qrcsterase cluster 

The major a-preferring isozyme has been char- 
acterised biochemically from members of the 
melanogaster and obscura groups (mainly D. 
melanogaster and D, pseudoobscura, respectively) 
in the subgenus Sophophora and the virilis (mainly 
Z). virilis) and repleta groups (Z>. mojavensis, D. 
buzzatii) in the subgenus Drosophila (Narise, 1973; 
Sasaki & Narise, 1978; East, 1982; Morton & 
Singh, 1985). 

The eiizyme has been at least partially purified 
(over 150-fold in some cases) by various size frac- 
tionating chromatographic procedures in all four 
lineages and proves to be a monomer of 55-70 KDa 
in all cases. In crude homogenates the enzyme from 
representatives of each lineage is highly sensitive to 
OPs and somewhat sensitive to sulfhydryl reagents 
and eserine sulfate, chissifying it as a subclass n 
cholinesterase on the inhibitor criteria (Morton & 
Singh, 1985; Healy, Ehunancic & Oakeshott, 1991; 
M.M, Dumancic, M.J.H. & J,G.O., unpublished 
data). One distinctive property of the enzyme in all 
four lineages is that it is strongly stabilised by 
p-mercaptoethanol, suggesting that its sensitivity 
to sulfhydryl reagents represents an unstable disul- 
fide bond (Morton & Singh, 1985; M.M. Duman- 
cic, M.J.H. & J.G.O., unpublished data). Another 
shared property is a widespread distribution across 
tissues but major concentration m gut tissue 
throughout the life cycle (Kambysellis, Johnsoft & 
Richardson, 1968; Sasaki, 1974; East, 1982; Mor- 
ton & Singh, 1985). The enzyme is also weakly 
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associated with membranes in several of the species 
(Korochkin, Matveeva & Kerlds, 1973; M.M, 
Dumancic, MJ.H. & J.G.O,, unpublished data). 
Thus, in terms of its biochemistry and physiology 
at least, there is a good case for the orthology of the 
major a-preferring isozymes across the four line- 
ages. 

Although not inconsistent with this conclusion, 
the one immunological study of the enzyme to date 
does show substantial epitope divergence among 
the species groups. Thus, Saski (1975) found that 
polyclonal antibody to the major a-preferrmg en- 
zyme of D. virilis crtiss-reacted with other species 
within the virilis group, but not outside it 

Consistent with the biochemistry and physiol- 
ogy, classical genetic analyses of allozyme variants 
also implies orthology of the major a-preferring 
isozyme across the four species groups. The 
structural gene Est9 maps to chromosome HIR in 
D. melanogaster (see above) and the structural 
genes for the major a-esterases in thfe other species 
analysed map to. the homologous chromosomes/ 
arms (Triantaphyllidis & Christodoulou, 1973; 
Tsuno, Aotsuka & Ohba, 1984; Morton & Singh, 
1985; Schaferer aL 1993), 

We have shown in the previous section that Est9 
of D. melanogaster lies in a cluster of about ten 
esterase genes. The three other enzyme activities so 
far traced to this cluster; ALI, EST23 and MCE, are 
significantly different from EST9 and its putative 
orthologues in the other species in terms of their 
electrophoretic mobilities and substrate and inhib- 
itor specificities. However, where information is 
available, they do resemble EST9 m being 
a-esterases associated with gut tissues and mem- 
branes. Likewise, there is evidence for additional 
a-esterase genes clustered around the putative Est9 
orthologue in the virilis group of species (Baker, 
1980; Korochkin et aL, 1987). None of these en- 
zymes were tested for ALI or MCE activities but 
one appears similar to EST23. On the basis of the 
virilis data, Korochkin et aL (1987) proposed the 
existence of an a-esterase gene cluster. We suggest 
that it is orthologous to the Est9-Ali'E$t23'Mce 
cluster in £>. melanogaster. The constitution of the 
two clusters may not be identical but multiple 
members of both clusters are membrane-associated 
gut a-esterases. 

There is also evidence for such a cluster in an- 
other dipteran, L cuprina. The E8 isozyme of L 



L cuprina shows strong similarities to EST9 of 
D. melanogaster both in vitro, in terms of its elec- 
trophoretic phenotype, and in Wvo, in its develop- 
mental profile and tissue localisation (Parker et aL, 
1991). E8 is monomorphic clectrophoretically, 
so has not been mapped genetically. However, 
three other esterases have been mapped to a IcM 
region on the L cuprina homologue of chromo- 
some niR that contains the Drosophila a-esterase 
cluster (Raftos & Hughes, 1986; Smyth et aL, in 
preparation; see also Lai-Fook & Smith, 1991). 
Two of these enzymes, E3 and MCE, are clearly 
homologous on biochemical and physiological cri- 
teria to EST23 and MCE witiim the A melanogas- 
ter cluster (Ruasell et aL, 1990; Parker etaL, 1991; 
Spackraan et aL, 1993; Smyth et aL, m prepara- 
tion). The other enzyme, E9, has no obvious homo- 
logue in D. melanogaster but it is interesting that it 
is another microsomal a-esterase (Parker et aL, 
1991; Smyth et aL, in preparation). 

We predict that the E3, E8, E9 and Mce genes 
will be in the L. cuprina homologue of the array of 
ten cloned genes to which we ascribe the a-esterase 
cluster of D. melanogaster. The high level of con- 
servation of some a cluster genes implied by the 
Lr cuprina comparisons is also evident in a neigh- 
bor joining network of aU insect carboxylesterases 
for which full sequence data are available. This 
network (not shown) indicates a close relationship 
between enzymes conferring OP insecticide resis- 
tance on mosquitoes and the EST9-EST23-ALI- 
MCE cluster in D. melanogaster, suggesting that 
esterases involved in OP resistance may be 
homologous across several Diptera. This network 
also shows a clear separation between the a and 
the p cluster of Z>. melanogaster described below. 

The ^-esterase cluster 

As with the a-preferring isozyme, the major p-pre- 
ferring esterase has been characterised biochemi- 
cally from representatives of the virilis, repleta, 
melanogaster and obscura species groups (Narise & 
Hubby, 1966; Narise, 1973; Sasaki & Narise, 1978; 
Mane, Tepper & Richmond, 1983; East, 1984; Pen, 
Rongen & Beintema, 1984; Pen, Van Beeumen & 
Beintema, 1986; Morton & Singh, 1985; White, 
Mane & Richmond, 1988; Fanner & Carter, 1989). 
For each group this includes purification, at least 
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partially, and in some cases to homogeneity (entail- 
ing about 200-300 fold purification, depending on 
the species). 

The major p-esterase is a homodimer variously 
estimated at 100-140 KDa in the great majority of 
species investigated. However, it is a monomer of 
half the molecular weight in three species, includ- 
ing A melanogaster, within the melanogaster com- 
plex of the melanogaster group (Morton & Smgh, 
1985). That the two forms of the enzyme are never- 
theless orthologous is supported by the cross-reac- 
tivity of antibodies raised against the enzyme from 
D, melanogaster with the dimeric enzyme found in 
some of the other species (D. Morris & R.C. 
Richmond, cited in Oakeshott, Healy & Game, 
19W). Consistent with this, D. pseudoobscura, in 
the obscura group, usuaDy has a dimeric form of the 
enzyme but monomeric variants also occur (Ar- 
nason & Chambers, 1987). Across all species studied 
the two forms of the enzyme also have qualitatively 
sunilar inhibitor sensitivities, being sensitive to 
eserine sulfate and OPs, but not sulfhydryl rea- 
gents. Although polyclonal antibodies against the 
enzyme from virilis, repleta and melanogaster spe- 
cies all show wide cross-reactivity with the other 
sp^ies groups they do not cross-react widi their 
naajor ot-preferring enzymes, and in the case of 
vkilis the converse also holds (Sasaki, 1975; Pen, 
Van Beeumen & Beintema, 1986; D. Morris & 
R.C. Richmond, cited in Oakeshott, Healy & 
Game, 1990). Thus, aO available biochemical and 
immunological evidence suggests that the major 
3-preferring isozymes in the different lineages are 
orthologous to each other. These data also support 
the nei^bor joining network above in indicating a 
clear distinction from the major a-preferring en- 
zymes. 

Unlike the a-enzymes, some clear evolutionary 
differences emerge in a comparison of the stage and 
tissue distributions of the major 3-preferring en- 
zymes across species. The enzyme is found in the 
hemolymph of all the species so far investigated but 
some additional sites of expression prove to be 
phylogenetically restricted. Two examples are male 
reproductive tract expression in some of the melan- 
ogaster species complex and eye expression in 
Z). pseudoobscura (Morton & Singh, 1985; Brady 
& Richmond, 1990; Oakeshott, Healy & Game, 
1990). 

Two further complications for the orthology ar- 



gument arise from a comparison of the genetics of 
the major p-preferring enzyme across species. 
First, the structuml gene is loosely linked to the 
a-esterase cluster and on the same chromosome 
arm in all the virilis and repleta species analysed, 
but lies on a different arm in the melanogaster and 
obscura -species studied (JUL in Z). melanogasten 
Morton & Smgh, 1985; Korochkin et al, 1987,* 
Oakeshott, Healy & Game, 199(); Schafer et aC 
1993). Second, while the gene lies in another clus- 
ter of esterase sequences in all. four lineages, the 
organisation of that cluster appears to vary substan- 
tiaUy among the lineages. Current knowledge of the 
cluster, termed the p-esterase cluster by Korochkin 
et al. (1987), can be summarized as follows. 
For the melanogaster group: As outlined in the 
previous section, EST6 in D. melanogaster is ex- 
pressed in both hemolymph and male reproductive 
tract Molecular analysis shows the gene to be 
the 5' member of a tandem pair of esterase se- 
quences separated by about 200 bp (CoUet et al, 
1990). Amino acid sequences inferred for the prod- 
ucts of the two genes are 59% identical. No electro- 
phoretic phenotype has been identified for the 3' 
member of the pair {EstP) but transcript analysis 
shows its expression to be largely confined to late 
larvae. 

For the obscura group: A 12 kb genomic frag- 
ment containing three esterase sequences has been 
isolated from D. pseudoobscura tasing D. melan- 
ogaster Est6 as a probe (Brady, Richmond & Oake- 
shott, 1990; Brady & Richmond, 1990, 1992). 
Amino acid identities inferred for the three pre- 
sumptive gene products lie between 65% and 81% 
and their identities with the melanogaster pair lie 
between 61% and 70%. The central gene is ortholo- 
gous to Est6 and encodes the major hemolymph 
P-esterase. Products and expression patterns have 
not been determined for the two flanking genes. 

For the virilis group: Classical genetic analysis 
of A virilis shows a cluster of three p-esterase 
genes, encoding the major hemolymph enzyme, a 
slower migratmg form mainly expressed in male 
reproductive tract and a faster form mainly ex- 
pressed in pupae (Korochkm a/., 1987). A 15 kb 
clone proposed to contain two of these genes has 
also been isolated (Enikolopov et aL, 1989). Se- 
quence analysis puts die two presumptive esterase 
genes in this clone about 500 bp apart, while 
genomic Southern analysis suggests the existence 
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of a third, closely related gene somewhere outside 
this 15 kb. Inferred gene products for the two 
cloned genes are about 50% identical to the melan- 
ogaster and obscura genes above. Curiously, both 
inferred gene products deviate significantly from 
other carboxyl/cholinesterases at several sites that 
are otherwise highly conserved in this family (Ser- 
geev et aL, 1993; M.Z. Ludwig, N.A. Tamarina & 
R.C. Richmond, pers. conun.). Also, dieir expres- 
sion patterns are very different from those of 
Est6IEstP in D. melanogasten The 5' member of 
the cloned Z). virilis pair is known to encode the" 
male reproductive tract enzyme and the 3' member 
is proposed to encode the hemolymph enzyme 
(Korochkin et al, 1990; Sergeev et a/., 1993). 

For the repleta group: Classical genetic analysis 
of D. mojavensis shows a cluster of two genes, one 
encoding the hemolymph p-esterase and the other 
producing a slightly slower migrating isozyme in 
late larvae which is polymorphic for a- and p- 
preferring forms (Zouros et dl, 1982; Zouros & 
Van Delden, 1982). The N-terminal 34 residues of 
the two purified proteins are 82% identical and 
there is also strong similarity in their overall amino 
acid compositions O^en, Van Beeimien & Bein- 
tema, 1986). Several repleta group species also 
show a separate, slower migrating p-esterase 
isozyme in male reproductive tract (Kambysellis, 
Johnson & Richardson, 1968) but the cognate gene 
has not yet been mapped. Using Z). melanogaster 
Est6 as a probe, East, Graham and Whitington 
(1990) cloned a 16 kb genomic fragment from 
D. buzzatii that may contain at least two and proba- 
bly three esterase or esterase-like genes; sequence 
analysis of two of these revealed high similarity to 
the other esterase genes above (66% amino add 
id^tity with each other, 47-50% whh die melan- 
ogaster genes). Intriguingly, however, both the D. 
buzzatii sequences encode a Gly in place of the 
active site Ser, a change which would almost cer- 
tainly render the proteins catalytically inactive. 
Since D. buzzatii does, in fact, produce active he- 
molymph and late larval p-esterase isozymes, there 
may be two clusters of p-esterase or p-esterase-like 
sequences in this species. 

It is clear then that many aspects of the p-est- 
erase cluster are highly variable and rapidly evolv- 
ing, even within the genus Drosophila. Three prop- 
erties which do recur across the lineages examined 
are expression in hemolymph, male reproductive 



tract and late larvae. However, in the subgenus 
Sophophora the first two of these properties are 
apparently expressed by the one enzyme whereas in 
the subgenus Drosophila they are expressed by dif- 
ferent enzymes. Additionally, there are differences 
in the number of genes in the cluster, differences in 
the substrate specificities of the late larval enzyme 
(at least in vitro), the accimiulation of sequence 
motifs in the Z). virilis enzymes that are otherwise 
unusual in the carboxyl/cholinesterase multigene 
family, and the intriguing possibility of non-cata- 
lytic functions for the sequenced D. buzzatii genes. 

Molecular bases for esterase eyoluttion: cross- 
functional comnpaLrisons 

Having identified patterns of conservation and 
change in the evolution of esterase functions across 
genes and species in the last two sections, we now 
examine their molecular bases. To assess stractural 
changes we use the generic structure-function 
model for the carboxyl/cholinesterase multigene 
family oudined earlier. Where appropriate we also 
call on our knowledge of promoter functions to 
explain regulatory changes. We begin with two 
comparisons involving qualitative differeiices in 
esterase function, to be followed by two involving 
qualitatively conserved functions. 

EST6from D. melanogaster versus AChE 

Figure 4 and Table 3 (top row) show how the ge- 
neric model of Cygler et al (1993) for the super- 
secondary structures of carboxyl/cholinesterases 
applies in the specific case of EST6 from £>. melan- 
ogaster. The model allows specific supersecondary 
structures to be ascribed to about half of the EST6 
primary sequence. The particular structural features 
that we are able to identify are the active site gorge, 
substrate binding sites, salt and cysteine bridges, 
the two p sheets and the various a helices (see 
Thble 4 for precise coordinates). The model also 
suggests that many of the regions to which no spe- 
cific structural feature could be ascribed lie in 
hydrophilic regions on the protein surface; such 
hydrophilic/surface regions have proved to be 
amongst the most variable parts of many other pro- 
teins (C:hothia & Lesk, 1986). 
Figure 4 and Table 3 also relate the predicted 
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Structure for EST6 to the sequence differences 
between EST6 and T. califomica AChE, which is 
one of the crystallised enzymes on which the 
structural model is based. Sequence differences 
between EST6 and AChE are widespread across all 
the regions distinguished. The features showing 
least variation in Table 3 are the minor P sheet, the 
various salt andcysteine bridges, and die catalytic 
triad at the base of the active site gorge. The 
function of the minor p sheet is unclear but 
relatively strong conservation might be expected 
for the other two elements, because they will be 
critical for the maintenance of the o/p hydrolase 
fold structure (notwithstanding the relatively high 
variation in the major P sheet itself) and for 



esteratic activity. One notable difference involves 
the acid residue in the catalytic triad which is a Glu 
in AChE, but Asp in EST6. Some of the conserved 
elements identified in Table 3 correspond to 
troughs in the graphical plot of divergence against 
primary sequence in Figure 4, although the 
averaging approach in the latter tends to obscvire 
conservation at the level of individual residues. 

Two of the most divergent features identified in 
Table 3 are within the active site gorge and in sub- 
strate binding regions. Variation in the gorge is parti- 
cularly concentrated in the so-called aromatic guid- 
ance residues which are involved in the movement of 
substrate down the AChE gorge (Rippol et al, 1 993). 
AChE contains 14 of these residues, but EST6 con- 
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tains only six across all the residues that the model 
predicts would be in the gorge. The three substrate 
binding regions largely comprise a heUces and 
overall they are no less divergent than other a he- 
lices. In absolute terms Tkble 3 shows substrate 
binding region 3 to be the most divergent, but the 
graphical plot m Figure 4 also shows a marked con- 
trast between the divergence in region 1 and the high 
conservation of the primary sequence around it 

The differences between EST6 and AChE in the 
gorge and substrate binding regions would be ex- 
pected to contribute substantially to their differ- 
ences in substrate utilization. This prediction has 
been confirmed empirically for selected residues 
(Gibney et al., 1990; Myers, Healy & Oakeshott, 
1993). For example EST6 and AChE have His and 
Glu respectively at the residue immediately adja- 
cent the Ser nucleophile on its amino terminal side. 
A synthetic mutant of EST6 with Glu at this posi- 
tion acquires some activity for the diagnostic AChE 
substrate acetylthiocholine (Myers, Healy & Oake- 
shott, 1993). 

EST6from D. melanogaster versus EST5 from D. 
pseudoobscura 

Although there is good evidence that the two en- 
zymes are orthologous, the comparison of A 
melanogaster EST6 with EST5 from D, pseudoob- 
scura nevertheless entails a qualitative shift in the 
function of the enzyme. As outlined earlier, EST5 
generally exists as a homodimer whereas EST6 in 
A melanogaster is a monomer. Also EST5 shows 
little activity in the male reproductive tract but high 
levels of activity in eyes. Consistent with the im- 
plied functional change, there is a high level of 
replacement site sequence divergence between the 
two enzymes. Thie level is 4-5 fold lower than the 
paralogous EST6/AChE comparison above but, at 
23 % of the presumptively unconstrained silent site 
divergence (Brady, Richmond & Oakeshott, 1990), 
it is still very high compared to the values below 
10% obtaiued for most otiier ortiiologous compari- 
sons among Drosophiia species (Karotam, Delves 
& Oakeshott, 1993). While some of this replace- 
ment variation may be adaptively neutral, it also 
seems reasonable to conclude that some proportion 
is responsible for tiie differences in tiie function of 
the enzyme between £>. pseudoobscura and Z>. 
melanogaster. 



ThbU 4, The relationship between primary sequence and func- 
tional regions as used in TOIe 3. Nomenclature and identifica- 
tion of functional regions in AOiE are taken from Cygler et al, 
(1993). The EST6 nomenclature is based on the model of Cygler 
et al (1993) and the aUgnment of EAvP. See also Figures 4 
through 7. Gaps or structures not present are indicated by a dash. 
See T^ble 1 for identifi cation of beta sheets and alpha heUces. 

Gorge residue s Substrate binding 

AChE EST6 AChE EST6 



Catalytic triad 




Site 1 


Ser-200 


Ser-188 


71-86 


69-77 


Glu-327 


Asp-319 




Site 2 


His-440 


His-445 


251-264 


237-251 


Aromatic guidance 




Site 3 


Tyr-70 


Trp-68 


270-278 


260-268 




- 






IYp-114 


His-105 




Bridges 


Tyr-121 


Met-112 


Salt Bridges 


Tyr-130 


His- 121 


Aig-44 


Arg-41 






Glu-92 


Glu-82 


Trp-233 


IVp-221 


Aig-149 


Arg-138 






Asp-172 


Asp-160 


TVp-279 


Ai:g-266 






Phe-288 


Pro-275 


GIu-163 


Asp-151 






AiB-267 


i.ys-254 


Phe-290 


Ala-277 


Asp-397 


Asp-394 


Phe-330 


TyT-322 


Aig-517 


GIu-521 


Phe-331 


Asn-323 


Disulfide linkages 


TyT-334 


Lcu-326 


Cys-67 


Cys^ 






Cys-94 


Cys-84 


'IVp-432 


Ala-430 


Cys-254 


Cys-240 


•iyr-442 


Asp-447 


Cys-265 


Cys-252 




Other 


Cys-402 








Cys-521 




Asn-66 


Ala-64 




Cys-493 


Val-71 






Cys-514 


Asp-72 


Asp^9 






Ser-122 


Phe-113 






Glu- 199 


His-187 






Glu-273 


Glu-260 






Leu-282 


Leu-269 






Asp-285 


Ser-272 






Ser-286 


Tyr-273 






IIe-287 


Val-274 






IIe-439 


Val-444 






Glu-443 


Asp-448 






He-444 


Tyret49 







255 



This proposition is supported by an examination 
of the nature and location of the 138 amino acid 
differences between the D. melanogaster and 
D. pseudoobscura enzymes (Brady, Richmond & 
Oakeshott, 1990). Thus a relatively high proportion 
of these differences are physicochemically non- 
conservative (39% for charge and size, and 33% for 
polarity and hydrophobicity, on the criteria of Tay- 
lor, 1986). Likewise, an unusually high proportion 
of the differences (12%) occur in hydrophobic re- 
gions of a hydropathy plot (H > 0.5) which are 
likely to be in die interior of the protein where a 
change might have more radical effects on struc- 
ture. It is also noteworthy that the hydropathy plots 
for the two enzymes diverge substantially in the 50 
residues at their carboxy termini. There is one seg- 
ment in this region in particular where 8 of 11 
contiguous residues involve a charge difference. 

Reference to the structural model for carboxyl/ 
cholinesterases in Table 3 and Figure 4 shows that 
the overall distribution of differences between 
EST6 and EST5 is similar to that from the previous 
comparison of EST6 and AChE. Two relatively 



conserved elements in the EST6/EST5 comparison 
are the catalytic triad and the salt and cysteine 
bridges, as they were in tixe EST6/AChE compari- 
son. Conversely, the active site gorge and substrate 
bindmg region are again the most divergent re- 
gions. As with the EST6/AChE comparison, this is 
consistent with a qualitative shift m esterase func- 
tion between EST6 and EST5. Significantiy, how- 
ever, the nature of the differences between EST6 
and EST5 in the gorge residues in particular is not 
die same as those between EST6 and AChE. For 
example, while the proportion of aromatic residues 
in the gorge is much less in EST6 than AChE, it 
remains about the same for EST5 as compared with 
EST6. In fact, all aromatic residues are identical 
between EST6 and EST5 throughout the gorge. 
This similarity may exemplify an aspect of sub- 
strate utilisation which EST6 and EST5 share with 
each other but not with AChE. 

Two other structural features against which we 
can compare EST6 and EST5 are potential glyco- 
sylation sites and the signal peptide. EST6 is 
known to utilise four glycosylation sites and fur- 
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Fig. 5, Inferred EST6 sequences for D. melanogaster (mel), with differences in D. simulans (sim) and D, mauritiana (mau) shown 
below. Gaps are represented by dots and residues involved in the catalytic triad or glycosylation sites (asterisked) are indicated by arrows. 
The signal peptide and three cysteine bridges (C-Q are shown above the sequence. Data arc from Figure 2 of Karotam, Delves and 
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themore, the glycans are known to stabilise the 
protein in vivo (Myers, Healy & Oakeshott, submit- 
ted). Glycosylation has not been studied in EST5 
but we note that only two of the four glycosylation 
sites in the D. melanogaster protein are conserved 
in D. pseudoobscura and the latter also has one 
other potential glycosylation site not found in 
D. melanogaster. Over half the 21 residues in the 
signal peptide differ between the two proteins, al- 
though both retain the hydrophobicity characteris- 
tics associated with functional secretion signals. 
Again, the functional significance of secretion is 
not known for EST5 but it is critical to the repro- 
ductive tract function of EST6 (see below). 

As we noted earUer, the qualitative differences in 
structural aspects of the enzyme in £). melanogaster 
and D. pseudoobscura are associated with several 
qualitative differences in its expression. It might 



therefore be expected that promoter sequences, like 
the coding region, would also have diverged sub- 
stantially between the two species. This expectation 
is fulfilled. The promoter for Est6 extends about 1 . 1 
kb 5' of the translation start site (Ludwig, l^marina 
& Richmond, 1993) but the Est5 promoter is barely 
half this length, extending no further than 450 bp 5' 
of the coding region (Brady & Richmond, 1990). 
Moreover, sequence divergence is so great that the 
alignment across some segments of the two pro- 
moters is no better than chance expectations 
(Brady, Richmond & Oakeshott, 1990). In general 
terms the divergence is lower near the start of the 
coding region, which is consistent with results from 
a fine-scale dissection of the D. melanogaster pro- 
moter showing a greats concentration of specific 
promoter elements in this region (Ludwig, Tkmar- 
ina & Richmond, 1993). It is also consistent with 
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the finding that many of the elements required for 
the £>. melanogaster-specific ejaculatory duct ex- 
pression lie in the distal part of that species* pro- 
moter. Elements directing the Z), pseudoobscura- 
specific expression in the eye have not yet been 
localised 



Molecnilar bases for esterase evolution: within 
function comparisons 

EST6 in D. nielanogaster versus D. simulans and 
D. maudtiana 

Unlike- the two comparisons above, jel comparison 
of EST6 among the sibling species D. melanogas- 
tery D, simulans and D. mauritiana does not involve 
any qualitative shift in EST6 function. Extensive 
comparative data on the biochemistry and physiol- 
ogy of EST6 in these three melanogaster subgroup 
species reveal some quantitative differences in 
EST6 activity levels but no qualitative change in its 
tissue or temporal distribution (Morton & Singh, 
1985; Karotam & Oakeshott, 1993, and references 
therein). The enzyme also takes a monomeric stmc- 
ture in all three species (Morton & Singh, 1985; 
Oakeshott, Healy & Game, 1990). Consistent with 
the implied conservation of function, the rate of 
replacement site divergence between D. melan- 
ogaster EST6 and either of the other two species' 
enzymes is lower than the corresponding rate from 
the comparison between Z>. melanogaster arid 
Z). pseudoobscura above (14% versus 23% of the 
respective silent site rate; Karotam, Delves & 
Oakeshott, 1993). The 30 differences in the mature 
EST6 proteins of the three melanogaster species are 
also less radical in physicochemical terms than the 
differences between tiie Z>. melanogaster and D, 
pseudoobscura proteins (17% versus 39% non-con- 
servative for charge, 33 % versus 39 % for size, 23 % 
versus 33% for polarity and hydrophobicity, on the 
criteria of Taylor, 1986). Moreover, only one of 
the 30 changes (a conservative Val/He difference 
at residue 182) lies in a hydrophobic sequence 
(H > 0.5), whereas 12% of the differences between 
the D. melanogaster and D. pseudoobscura pro- 
teins lie in hydrophobic regions. 

Further evidence that the amino add differences 
in EST6 between the three sibling species may have 
relatively little effect on function comes from over- 
laying diem on the general model for carboxyl/ 



cholinesterase structure (Table 3, Figs. 5 and 6). 
Whereas the divergent residues in the cross-func- 
tional EST6/AChE and EST6/EST5 comparisons 
were widely distributed across structural elements, 
the differences in EST6 among the sibling species 
are now confined to certain structural features. 
Most notably, the conservation of the catalytic triad 
between EST6 and EST5 now extends to all resi- 
dues identified in the active site gorge and the triad 
residues are also embedded in invariant stretches of 
at least ten amino acids in the primary sequence. 
This suggests tight conservation of the catalytic 
mechanism across the three species. Other struc- 
tural features whose residues are invariant in this 
comparison include the minor P sheet and all the 
salt and cysteine bridges. The conservation of salt 
and cysteme bridges suggests conservation of 
broad stmctural and stability parameters of the en- 
zyme. 

Regions that remain variable among the three 
species are the substrate binding elements, the ma- 
jor p sheet, the a helices and the regions to which 
no specific structures are assigned by the model. 
The divergence in many of these elements may 
have little impact on function but some of the varia- 
bility in substrate bindmg sites 2 and 3 might be 
expected to affect substrate specificities. Whetiier 
this is true awaits empirical data comparing EST6 
substrate specificities across the three species. 

Two other elements of known function that also 
vary across the three sibling species involve a gly- 
cosylation site and the signal peptide (ICarotam, 
Delves & Oakeshott, 1993). Three of the four gly- 
cosylation sites found in D. melanogaster recur in 
the other two species and neither of the latter show 
any new sites. However the fourth site, residue 487 
in D. melanogaster^ is absent in D. simulans and 
D. mauritiana, Hiis difference could affect the in 
vivo stability of the enzyme, although we note that 
this site is also polymorphic in D. melanogaster^ so 
it is not a fixed difference between the species 
(Myers, Healy & Oakeshott, 1993). The hydropho- 
bic core of the signal peptide is also two residues 
shorter in Z). simulans and D. mauritiana than in 
D. melanogaster (ten versus twelve residues). This 
difference could affect the secretion of EST6, al- 
though both forms of the signal peptide are within 
the bounds expected to be functional. 

In contrast to the relatively high level of replace- 
ment site variation in the Est6 coding region, the 
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promoter shows relatively little variation among 
the three melanogaster subgroup species. Most of 
the tissue-specific elements in the D. melanogaster 
promoter lie in the first 300-400 bp 5' of the gene, 
although some elements controlling male reproduc- 
tive tract expression lie in the next 600-700 bp 
(Ludwig, Tamarina & Richmond, 1993, and see 
above). Preliminary analysis of the D. simulans and 
Z). mauritiana promoters in transgenic Z). melan- 
ogaster suggests that the qualitative similarities and 
quantitative differences in EST6 expression be- 
tween the species are encoded by the first LI kbof 
5' DNA (Karotam, Delves & Oakeshott, 1993). 
SequOTce divergence across the three species is 
quite limited in the proximal third of this segment 
(15% of the silent site rate for £>. melanogaster 
versus the other two species) but much less con- 
strained in the remainder (75 % ; Kaiotam, Delves & 
Oakeshott, 1993), Features of tiie proximal seg- 
ment that are absolutely conserved include the 
TATA box and two stretches of over 100 bp which 
contain elements directing expression in four dis- 
tinct tissues. The conservation of this proximal 
thud is consistent with the qualitative similarity in 
the temporal and tissue specificity of Est6 expres- 
sion across the three species. On the other hand, the 
divergence in the remainder of the promoter may 
explain the quantitative differences in EST6 activi- 
ties among the three species. 



Polymorphisms within D, melanogaster and D. 
simulans 

Before discussing the molecular bases of polymor- 
phisms in the structure and expression of EST6 in 
D. melanogaster and £). simulans, we first sum- 
marise the wealth of data on their classical popula- 
tion genetics. The major reason for the interest in 
these polymorphisms by population geneticists has 
been the physiological evidence that EST6 contrib- 
utes to fitness duough effects on reproductive be- 
havior. Most of the major pulse of EST6 expression 
in die anterior sperm ejacuktory duct of the adult 
male is transferred to the female during mating. 
Females mating with wild type males show greater 
oviposition behavior and slower receptivity to re- 
mating than do females mating with males ho- 
mozygous for a laboratory mutant lacking any 
EST6 activity (Richmond etaL, 1990, for a review; 



see also Miekle & Richmond, 1990; Myers, Healy 
& Oakeshott, submitted). 

Allozyme polymorphism 

D, melanogaster is polymorphic for two major al- 
lozymes, denoted EST6-F and EST6-S. These vari- 
ants show complementary latitudinal clines, such 
that EST6-S frequencies .tend to increase at the ex^ 
pense of EST6-F frequencies at higher latitudes. 
The relationship extends over 40 oC of latitude and 
is broadly consistent across different continents and 
both hemispheres (Oakeshott et al, 1981; but see 
also Jiang, Gibson & CJien, 1989). Consistent with 
these clines, weak but recurrent seasonal trends 
have also been recorded, showing EST6-S fi^quoi- 
cies to increase at die expense of EST6-F in cooler 
seasons (Franklin, 1981; Oakeshott, Wlson & 
Knibb, 1988). D. simulans is also polymorphic for 
EST6-F and EST6-S allozymes and the electro- 
phoretic mobilities of these variants are identical to 
the corresponding D. melanogaster allozymes un- 
der standard electrophoretic conditions. Moreover, 
parallel latitudinal clines are also found for the 
D. simulans variants (Anderson & Oakeshott, 
1984). All these data suggest that natural selection 
differentiates between EST6-F and EST6-S and that 
the molecular target and mechanism of the selection 
are shared between the two sibling species. 

The first major complication for this interpreta- 
tion comes fxom evidence for additional structural 
variants segregating within EST6-F and EST6-S. 
This variation is cryptic to the standard electro- 
phoretic procedures used to describe the clines but 
can be detected by thermostability analyses or 
higher^ resolution electrophoresis (Cochrane & 
Richmond, 1979; Cooke, Richmond & Oakeshott, 
1987). In D, melanogaster there are several rela- 
tively common variants within EST6-F but one 
form, denoted EST6-8, dominates in firequency 
within EST-S in all populations investigated (La- 
bate et al, 1989). In D, simulans several variants 
have been reported within botii EST6-F and EST6-S 
but none of tiiem correspond to EST6-8 (Albuquer- 
que&Napp, 1981; Karotam, Boyce& Oakeshott, in 
press). Two distinct selective processes have there- 
fore been proposed (Oakeshott et aL, 1989): one 
targets die overall EST6-F/EST6-S difference and 
explains the clines in both species, while the otiier 
distinguishes EST6-8 widiin EST6-S and accounts 
for its proliferation in D. melanogaster. 
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A second difficulty in further elucidating the se- 
lection among EST6 allozymes is the overall failure 
of about 40 attempts to detect the selection in labo- 
ratory populations of D, melanogaster (Oakeshott 
et al, 1989; Richmond etaU 1990, for references). 
While many studies have reported fitness differ- 
ences associated with the major EST6-F/EST6-S 
difference there is litde consistency in the effects 
seen across different studies. Notably few studies 
have specifically examined reproductive compo- 
nents of fitness that might be relevant to the physio- 
logical function of the enzyme in the ejaculatory 
duct; nevertheless the few that have, have failed 
(Saad et al, submitted). Importantly also, only two 
studies have distinguished the minor variants segre- 
gating within EST6-F and EST6-S, but neither 
found any evidence for a fitness advantage to 
EST6-8 (Saad et ai, submitted; Oakeshott et al, 
submitted). Notwithstanding the methodological 
problems with some of the studies, the overall con- 
clusion must be that the selection among EST6 
allozymes mferred from the field data is either in- 
operative, or too weak to detect, in the laboratory. 

Amino acid polymorphism 
Molecular analyses of the various EST6 allozymes 
can explain some of these complications and anom- 
alies but they also radically change our interpreta- 
tion of the selection inferred firom the field data on 
allozyme frequencies. Seventeen isolates of the 
Est6 gene covering fourteen of the allozymes de- 
tectable by high resolution electrophoresis have 
now been sequenced in D. melanogaster and 
£>. simulans (Cooke & Oakeshott, 1989; Karotam, 
Boyce & Oakeshott, in press). The high level of 
amino acid polymorphism revealed puts EST6 
among the most polymorphic isozymes yet char- 
acterised at a molecular level in any species. On 
average, any two of the EST6 aUozymes ftom 
D, melanogaster differ by about four amino acids, 
and the corresponding figure for D. simulans is 
seven. Several of the amino acid polymorphisms 
prove to have no mobility phenotype even under 
high resolution electrophoresis or isoelectric fo- 
cussing (RH. Cooke & J.G.O., unpublished). 
Clearly some of the anomalies in the laboratory 
fitness comparisons among Est6 genotypes could 
simply reflect the use of different EST6 variants. 

Surprisingly, no single amino acid polymor- 
phism is invariandy associated witii the EST6-F/ 



EST6-S difference across the thirteen sequenced 
isolates from D. melanogaster which have been 
sequenced. However, most of the EST6-F group of 
allozymes can be distinguished from most of the 
EST6-S group by two amino acid polymorphisms 
in tight linkage disequilibrium with each other, 
Asp/Asn-237 and Thr/Ala-247 (with the EST6-F 
amino acid given first). We suggest that the EST6- 
F/EST6-S clines in D, melanogaster are an imper- 
fect reflection of selection on either or both of these 
amino acid differences. Of these two, die Asp/Asn- 
237 would be the best candidate for selection, 
partiy because it involves a charge difference that 
presumably causes the electrophoretic mobility dif- 
ference, and also because its association with the 
EST6-F/EST6-S difference is slightiy stronger than 
is that for Thr/Ala-247 (see below). 

The data set for £>. simulans only includes four 
isolates but this is sufficient to show that EST6-F 
and EST6-S in this species are not distinguished by 
the same polymorphisms as those most strongly 
associated with the electrophoretic difference in 
/>. melanogaster. Jn D. simulans the two polymor- 
phisms distinguishing tiie EST6-F and EST6-S se- 
quences are Thr/Asn-237 and Asp/Val-487, the 
only overlap with the two in Z>. melanogaster 
above being tiie Asn-237 in EST6-S. Therefore, the 
earlier conclusion of a common target and mecha- 
nism of selection in the two species can only be 
retained by proposing that selection would recog- 
nise Asp-237 and Thr-237 in the two species* 
EST6-FS as differing in the same way from Asn- 
237 m the two EST6-Ss. This does not seem hkely 
for two reasons. Firstly, it seems unlikely on phys- 
icochemical grounds, since Asp is charged but Thr 
not Secondly, since only the Asp/Val-487 of the 
two Z). simulans polymorphisms involves a charge 
difference, only this 487 polymorphism is Ufcely to 
be causally connected with the electrophoretic mo- 
bility difference. Given the distance between the 
two polymorphisms and the fact that only four 
D, simulans sequences were sampled, the associa- 
tion of Thr/Asn-237 with die EST6-F/EST6-S may 
not even hold up in the wider population. 

Another revision to our thinkhig caused by the 
molecular data concerns the EST6-8 allozyme that 
is conunon in D. melanogaster but absent from 
D, simulans. On the assumption that the EST6-F/ 
EST6-S difference has the same molecular basis in 
the two species, the proliferation of EST6-8 within 
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EST6-S in D. melanogaster was originally attrib- 
uted to the spread of a different amino acid poly- 
moiphism from those underiying the clinal selec- 
tion between EST6-F and EST6-S. Since the as- 
sumption about EST6-F/EST6-S is almost certainly 
wrong, there is no need to propose that the demog- 
raphies of the EST6-8 and EST6-S variants in 
Z). melanogaster reflect the demographies of differ- 
ent amino acid polymorphisms. Indeed, given that 
EST6-8 is by far the most common allozyme within 
the EST6-S group, they probably reflect effects on 
the same amino acid polymorphisms. 

One of die origmal ideas about EST6-8 that is 
supported by the molecular data is the notion that it 
has arisen and spread relatively recenriy. Sequenc- 
ing of two Est6-8 isolates from different continents 



reveals no replacement or silent site differences 
between them, whereas replicate isolates for two 
other aliozymes differ at an average of seveasuch 
sites. Importantly however, the molecular data also 
show that Est&S is just a representative of a small 
group of four haplotypes that are closely related to 
one another (average of three differences in re- 
placement and silent sites) but distantly related to 
the eight other haplotypes detected (average of 
twelve differences). The EST6-8 group contains 
three EST6-S haplotypes and a rare faster allozyme 
(EST6-F'), while the other group includes all the 
EST6-F forms, another rare very fast mobility form 
(EST6-VF), and the fastest mobility form within 
EST6-S, The only polymorphism invariably associ- 
ated with the two groups of haplotypes is Asp/Asn- 
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237 but Thr/Ala-247 and several silent polymor- 
phisms less than 300 bp away are also strongly 
associated with the two haplotype groupings (the 
only exception for 247 being Ala-247 in one of the 
EST6-8 group of haplotypes). While the 237 poly- 
morphism thus remains die most likely target for 
the selection inferred from the EST6-F/EST6-S 
clines and the proliferation of EST6-8, the response 
to selection has nevertheless involved a block of 
several closely linked polymorphisms. It is at least 
a formal possibility that some of the silent polymor- 
phisms could influence phenotype and fitness via 
effects on the regulation of the EstP gene immedi- 
ately downstream. 

Functional effects of the amino acid polymor- 
phisms 

How then do the various amino acid polymor- 
phisms in Z). melanogaster and P. simulans relate 
to the structural model for carboxyl/choline- 
sterases? As was the case with the interspecific 
comparisons among Z). melanogaster^ D. simulans 
and D, mauritiana (Fig. 7, Table 3), the active site 
gorge, salt and cysteine bridges and the minor ^ 
sheet are essentially invariant (see below for one 
exception). Thus, these structures are divergent in 
comparisons among enzymes with qualitatively 
different functions (i.e., EST6/AChE and EST6/ 
EST5) but conserved in comparisons of enzymes 
with qualitatively similar functions (i.e., EST6 
within and among the sibling species). On the other 
lulnd, there is also relatively greater conservation in 
the major 3 sheet and many of the a helices than 
was evident in the comparisons between the sibling 
species. The vast majority of the polymorphism lies 
either in substrate binding sites or in regions to 
which no particular structures are ascribed in the 
model. 

In fact, only seven out of the total of 32 polymor- 
pljisms lie in structures to which specific functional 
effects can be attributed, four in substrate binding 
site 2, two in a glycosylation site and one in a 
cysteine bridge. The latter, a Cys/TVr-514 polymor- 
phism in D. melanogaster, is the cause of an ex- 
tremely rare, smeary electrophoretic phenotype that 
was specifically chosen for sequencing because of 
this phenotype but which we have only ever noticed 
in a single field strain. The other six of these poly- 
morphisms are all relatively common. There are 
three of them in each species and, significantly, 



they include both polymorphisms most strongly 
associated with EST6-F and EST6-S in each spe- 
cies. Because two of the sites for these six polymor- 
phisms are shared across the two species, the six 
differences only involve four sites, 237, 243, 247 
and 487. 

The Asp/Asn-237 and Thr/Ala-247 polymor- 
phisms which are most strongly associated with the 
EST6-F/EST6-S difference in D. melanogaster 
both lie in substrate binding site 2. Asp/Asn-237 
produces the charge differ^ce that presimiably 
causes the electrophoretic mobility difference 
while Thr/Ala-247 is also non-conservative for po- 
larity. Some effects on substrate interactions would 
therefore seem possible for both polymorphisms, 
although they are arguably more likely for the 
charge-non-conservative Asp/Asn-237. Differ- 
ences in kinetic parameters related to substrate in- 
teractions and in substrate and (competitive) inhib- 
itor specificities have indeed been reported between 
purified EST6-F and EST6-S (Danford & Beard- 
more, 1979; White, Mane & Richmond, 1988). 
This further supports the proposition that the 237 
and/or 247 polymorphisms are targets for the natu- 
ral selection that underlies the clines for EST6-F/ 
EST6-S (and the rapid proliferation of EST6-8). 
The third common polymorphism in D. melanogas- 
ter that the structural model suggests might affect 
function is Ser/Ala-487, which results in the pres- 
ence/absence of the fourdi and final glycosylation 
site in the EST6 primary sequence. As explained 
earlier, EST6 mutants engineered to lack this plus 
its three other glycosylation sites are less stable in 
vivo. Consistent with this, the Ser/Ala-487 differ- 
ence is associated with minor electrophoretic and 
thermostability variation within both EST6-F and 
EST6-S (Cooke & Oakeshott, 1989). 

As in D. melanogaster^ two of the three conunon 
Z). simulans polymorphisms adjudged on structural 
groxmds most likely to affect function lie in sub- 
strate binding site 2 and the third lies in the fourth 
glycosylation site above. In Z>, simulans the bind- 
ing site polymorphisms are Thr/Asn-237 and Ala/ 
Ser-243, the first of which is associated with the 
EST6-F/EST6-S difference. Thr/Asn-237 is non- 
conservative for hydrophobicity while Ala/Ser-243 
is non-conservative for polarity and hydrophobi- 
city. Effects on substrate interactions therefore 
seem possible for both, although we are unaware of 
any empirical data to test this possibility. The third 
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Z). simulans polymorphism in this group is Val/ 
Asp-487, which produces the charge difference 
assumed to underlie the EST6-F/EST6-S electro- 
phoretic mobility difference in this species. Resi- 
due 487 aligns with the glycosylation site in 
a melanogaster EST6 but neither amino acid at 
this position in the D. simulans protein would in 
fact produce a glycosylation site. However, Val/ 
Asp-487 is non-conservative for size and hydro- 
phobicity as well as for charge. That such a phys- 
icochemically radical difference occurs at a residue 
where glycosylation affects the function of the D, 
melanogaster protein suggests the possibility of 
functional effects in £>. simulans EST6, albeit ones 
we cannot yet specify. 

Promoter polymorphism 

Although the analyses are less advanced than for 
the structural region, early data suggest that regula- 
tory polymorphisms also contribute substantially to 
EST6 phenotypic variation within D. melanogaster 
and D. simulans. Heritable 2-3 fold differences in 
^max have been reported among 42 

third chromosome isoallelic lines extracted from a 
natural population of D, melanogaster (Game & 
Oakeshott, 1989). Differences of a similar order 
have also been recorded in seventeen of these lines 
scored for EST6 V^^^^ m first instar larvae and 
mid-pupae of this species (Oakeshott et aL, submit- 
ted). Only a small minority of the variation in any 
of the measures can be explained by differences 
among the six EST6 allozymes that high resolution 
electrophoresis shows to segregate among the lines. 
More of the activity variation is associated 
with restriction fragment length polymorphisms 
(RFLPs) located 5' of the gene. 

More specifically, all significant between-line 
variation in larval and pupal activities is associated 
with polymorphism for the insertion of a transposa- 
ble element about 1,4 kb 5' of the gene (Oakeshott 
et al, submitted). We have seen that most of the 
Est6 promoter lies closer to the gene, so this effect 
may be an indirect one, due to interference in the 
promoter*s functions during transcription of the 
transposable element. No 5' RFLPs have been re- 
lated to the variation in adult female activity levels 
but about 20% of the differences between lines in 
male activity levels is associated with a Rsal poly- 
morphism about 540 bp 5' of the coding region 
(Game & Oakeshott, 1990). Promoter elements di- 



recting Est6 expression in adult males are known to 
lie in or near the Esal site. However, without full 
sequence data it is not possible to determine 
whether the Rsal polymorphism is causally related 
to the activity variation, or simply in linkage dise- 
quilibrium with other promoter polymorphism(s) 
which are more directly involved. 

Although only seven D. simulans lines have been 
compared for EST6 V^, high levels of variation 
among the lines are evident in both adult males and 
females (Karotam & Oakeshott, 1993). Hie Est6 
promoter has also been sequenced for four of these 
lines (Karotam, Boyce & Oakeshott, in press). Al- 
though the data set is too small to relate the activity 
differences to particular promoter polymorphisms, 
some broad patterns are manifest in the distribution 
of the nucleotide variation along the promoter. 
Polyinorphism is extremely low in the proximal 
tiiird of tiie 1 . 1 kb of 5' DNA sequenced (6% of die 
silent site level in the coding region), but much 
higher in the remaining two thinls (41% of the 
silent site level). This distribution closely parallels 
the pattern for the same region we saw earlier in the 
interspecific comparisons among D. melanogaster^ 
D. simulans and D. mauritiana. However, as with 
these earlier comparisons, the localisation of the 
particular sequence differences causing the activity 
variation will await more detailed dissection of 
Est6 promoter function. 



Gomiclusionis 

The aim of this essay has been to reconstmct some 
of the macro- and micro-evolutionary processes 
shaping the diversity of esterase enzymes found in 
higher organisms. The focus of our attention is 
Drosophila, D, melanogaster in particular, because 
of the wealth of data on its biochemical, molecular 
and evolutionary genetics. However, our interpreta- 
tion of these data relies heavily on functional classi- 
fication schemes and structure-function models 
origiiially developed with vertebrate and microbial 
esterases. 

About 30 distinct esterases have been detected in 
D. melanogaster by electrophoretic analyses and 
several others have been identified from this spe- 
cies by spectrophotometric methods. Most can be 
classified as carboxyl, choline or acetyl esterases on 
a set of inhibitor-based criteria originally devel- 
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oped for mammalian esterases. A fourth class 
found in mammals, the aryl esterases, has not yet 
been found in D. melanogaster, although it is not 
clear that the assays used with Z). melanogaster 
have been appropriate to detect such a class. The 
preponderance of carboxyl and cholinesterases in 
both mammals and D. melanogaster contrasts with 
the finding that most prokaryotic esterases are aryl 
or acetyl esterases or, more often, not classifiable 
against the inhibitor criteria. Thus, esterases can be 
partitioned into several families on in vitro bio- 
chemical criteria but there is only limited overlap 
between the families that have proliferated in 
eukaryotes versus prokaiyotes. 

Sequence data for over fifty cloned esterase 
genes establish that at least some of these biochem- 
ically defined families represent distinct phylogen- 
etic lineages as well. There are significant similari- 
ties between many of the eukaryotic carboxyl 
esterases and all the eukaryotic cholinesterases se- 
quenced to date, so these enzymes are all placed in 
a single carboxyl/cholinesterase multigene family. 
This family encompasses all the Drosophila 
esterases so far sequenced There is minimal se- 
quence similarity between the carboxyVcholinest- 
erase and any other esterases but crystal structures 
for selected enzymes reveal striking similarities 
with some other manmialian carboxyl esterases, 
mainly Upases, and some bacterial aryl esterases! 
This suggests that a high proportion of eukaryotic 
esterases and at least some prokaryotic esterases 
share a common, albeit ancient, ancestry. All the 
enzymes that show the characteristic tertiary struc- 
ture, or bear sequence similarity to enzymes with 
this structure, are placed in a superfamily, called 
the o/p hydrolases. The affinities of the remaining 
enzymes, mostly aryl or acetyl esterases or un- 
classifiable on the inhibitor criteria, are generally 
unknown, although a few of the unclassifiable en- 
zymes show sequence similarity with serine pro- 
teases. 

About fifteen esterase genes have been mapped 
by classical or molecular methods to the D. melan- 
ogaster genome. Five separate chromosomal sites 
are implicated, with two genes so far identified at 
one site, and about ten at another. The latter cluster, 
termed the a-cluster, includes both carboxyl and 
cholinesterases, so its origin may pre-date the car- 
boxyl/choUnesterase split. Certain members of this 
family are tighdy conserved across diverse Droso- 



phila species and preliminary molecular work also 
suggests orthology with esterases implicated in in- 
secticide resistance in several other insects. By con- 
trast, the other, smaller cluster, seems to be evolv- 
ing more rapidly. Both genes in this cluster, termed 
the p-cluster, are found in other members of the 
subgenus Sophophora, although the expression of 
one gene varies qualitatively in D. pseudoobscura 
and that species' p-cluster also contains a third 
esterase sequence. Evidence to date suggests that 
the number, organisation and expression of genes 
in the p-cluster all vary in more distantly related 
species fiom the subgenus Drosophila and there is 
indirect evidence for two p-clusters in some of 
those species. 

We have used the EST6 carboxyl esterase jfrom 
the p-cluster to mvestigate the molecular basis for 
fimctional evolution within the carboxyl/cholinest- 
erase multigene family. Preliminary modeUing 
shows good agreement with the super-secondary 
structure for a vertebrate cholinesterase, AChE. 
whose crystal structure has been solved. However] 
primary sequence differences between the two en* 
zymes are widespread throughout the super-secon- 
dary structure. Divergence is lowest around salt and 
cysteine bridges that stabilise the overall ot/p hy- 
drolase fold structure. On the other hand, individual 
residues in the a helices and p strands that make up 
this overaU structure are quite variable. Indeed 
these structures are no less variable than regions 
largely expected to be on the protein surface to 
which no particular structures are ascribed by the 
model. High levels of divergence are also evident in 
substrate binding regions and the active site gorge 
and these may be critical to the fimctional differ- 
ences between the two enzymes. In particular, most 
of the aromatic guidance residues in the active site 
gorge of AChE do not occur in EST6. 

Comparisons of EST6 from D. meUmgaster 
with the orthologous EST5 ftom D. pseudoobscura 
also involve a qualitative change in function, die 
former being mainly expressed in male reproduc- 
tive tissue and the latter in the eye. This difference 
in function is again associated with a high level of 
sequence divergence; although the two species lie 
in the same subgenus, the two enzymes differ at 
about a quarter of their residues. As with the EST6/ 
AChE comparisons, the sequence differences are 
also widespread across the predicted super-secon- 
dary structure. Salt and cysteine bridges are again 



264 



conserved but high levels of divergence occur in 
the a helices, the P strands in both p sheets, the 
substrate binding sites, the active site goige and 
regions for which no particular higher order struc- 
ture is identified. 

Unlike the EST6/AChE and EST6/EST5 com- 
parisons, comparisons of EST6 between D. melan- 
ogaster and its two sibling species D. simulans and 
D. mauritiana involve a high level of functional 
conservation. Although sequence divergence is 
again high compared to other enzymes, the loca- 
tions of the differences in the super-secondary 
structure are now more restricted. The relatively 
conserved elements now include the active site 
gorge and the smaUer of the two 3 sheets, as well as 
the salt and cysteine bridges. Variation is concen- 
trated in a helices, the major 3 sheet, regions with 
no identified higher order structure and, perhaps 
surprisingly, the substrate binding sites. 

As in the interspecific comparisons, levels of 
EST6 polymorphism within D, melanogaster and 
D. simulans are very high compared to other en- 
zymes. However, the location of the variation is 
even more restricted than in the comparisons be- 
tween the sibling species. Variation is now essen- 
tially confined to the substrate binding sites and the 
regions for which no particular higher order struc- 
ture is identified. It is possible that many of the 
polymorphisms in the latter regions may be neutral 
to function and fitness. Howe vet, kinetic studies on 
purified preparations of A melanogaster allozymes 
differing in their substrate binding sites reveal dif- 
ferences in substrate interactions and specificities. 
Furthermore demographic data on these aUozymes 
reveal non-random spatial and temporal distribu- 
tions indicative of natural selection. 

Analyses of the respective promoters are less 
advanced than the structural comparisons of EST6 
within and among the various species above. 
Nevertheless some clear parallels in the pattern of 
conservation and change emerge. Thus the promot- 
ers of Est6 in D. melanogaster and ft pseudoob- 
scura show limited sequence similarity and some 
fundamental differences in organisation. High lev- 
els of sequence variation are also evident in com- 
parisons of the Est6 promoters in D, melanogaster^ 
2). simulans and D. mauritiana but their overall 
organisation is conserved and much of the first 300 
bp of 5' sequence is absolutely invariant. The same 
holds true of the promoter polymorphisms within 



£>. melanogaster and D. simulans. As with the 
structural polymorphisms, some of the promoter 
polymorphisms may be irrelevant to function and 
fitness but particular polymorphisms lie in ele- 
ments which early functional data indicate are re- 
quired for specific pulses of expression and demo- 
graphic data suggest are under selection. 
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Abstract 

6 (£sr.6/EST6) is the major P-carboxylesteiase in D, melanogaster and its siblings Z>, sinmlans and 
^f^^'^T'} Ai^'^^o!^ "^^^"^ ^"""^ '""J^ ^''^ of expression is the sperm ejaculatory duct 
andacuvitypolj^oiphi^^ 

w ? ^ fJf^ ^' melanogaster sequences. As might be expected, replacement sites are sig^ 
St6 iST^'T^^^ '^""^ '^"^'^'^ ^ ^ comparisons, suggesting that selection is acting to maS 
^It fonction among the three species. Nevertheless, the ratio of the levels of replacement to 

silent site divergence is stiU much higher for Est^6 than for seven of ten other genes (inclu^TboS 
S;?^Tff comparable data have been published for these species. This is co4tS 

iS^h^f Jfl EST6 electrophoretic polymorphism within Z>. melanogLr and D. simu^^l 
imphes that selective constramts against amino acid change are relatively weak for EST6 By contrast 
a)mpamonsmyolvmgprom^^^^ 

for these sp^ies. In particular, there are two perfectly conserved stretches (-1 to -158bp and -219 to 
-334bp)each^^^^ 

sdective con^t on the ammo acid seq^^^ 

affecting aspects of its expression. ^ 



Introduction 

Interspecific nucleotide sequence comparisons are 
now widely used to identify regions of conservation 
or change in the coding sequences of genes (Le- 
wontin, 1989). Only recently, however, have inter- 
specific comparisons beeri used to assess conserva- 
tion or change in flanking sequences, and in partic- 
ular, promoter sequences. Comparisons of diver- 
gence rates for such sequences with those for syn- 



onymous and codmg changes hi the corresponding 
structural sequences hold great potential for ad- 
dressing issues concemmg the selective constraints 
on such regions, the neutral rate of evolutionary 
change and the relative importance of regulatory 
change in evolution. Some mterspecific com- 
parisons of 5' flanking regions have already identi- 
fied conserved elements in promoter regions (e.g. 
Kassis et aL, 1989; Jones et al, 1991) and, m a few 
cases, have lead to the identification of specific 
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differences in promoter sequences responsible for 
interspecific variation in gene expression (Bray & 
Hirsh, 1986; Brennan etaL, 1988). 

The esterase 6 gene-enzyme system of D. melan- 
ogaster and its sibling species D. simulans and D. 
mauritiana is becoming an informative model sys- 
tem for studying both structural and regulatory evo- 
lution. Considering first its structure, EST6 is a 
monomeric glycoprotein of about 60 kiloDaltons in 
all three species (Morton & Singh, 1985). The en- 
zyme is electrophoretically polymorphic in the two 
cosmopolitan species D. melanogaster and D. sim- 
ulansy and parallel latitudinal clines are found in the 
firequencies of the common electrophoretically 
slow allozyme in both species (Anderson & Oake- 
shott, 1984). The shared clines suggest both that 
this variant is subject to selection and that the 
molecular basis for the selection is shared in the 
two species. 

The regulation of EST6 in the$e spocies is simi- 
lar in that the major pulse of expression in all three 
species is in the anterior ejaculatory duct of adult 
males (Stein et aL, 1984; Morton & Singh, 1985). 
The enzyme is transferred to females during mating 
(Richmond & Senior, 1981; J. K. & J. G. O., un- 
pubL data) and affects the females' subsequent re- 
productive behavior (Scott, 1986). Various lines of 
evidmice from Z). melanogaster suggest that the 
level of EST6 expressed in the ejaculatory duct 
affects reproductive fitness (Gilbert et aL, 1981) 
and one restriction fragment length polymorphism 
about 500bp 5' of the gene is associated with popu- 
lation variation in male EST6 activity levels (Game 
& Oakeshott, 1990). Z). melanogaster^ D, simulans 
and D. mauritiana are the only species in the 
melanogaster subgroup in which male EST6 activ- 
ity is significantiy higher than female activity 
(Richmond et ioL, 1990), although in geneM D. 
simulans males have higher EST6 activity than Z>. 
melanogaster and i>. mauritiana males (L K. & J. 
G. O., unpubl. data). 

The evidence on bodi the structural and regula- 
tory features of esterase 6 thus suggests that the 
enzyme may have a specific function in reproduc- 
tion which is subject to selection within species and 
conserved across D. melanogaster^ D, simulans and 
D. mauritiana. As part of an ongoing investigation 
into the molecular bases of such selection and con- 
straint, the present study examines patterns of di- 
vergence in both the structural and flanking se- 



quences of the Est'6 locus among the three species. 
The D. melanogaster gene has been cloned and 
characterized previously (Oakeshott et oL^ 1987; 
Collet et aLy 1990) and die current paper reports the 
cloning and sequencing of the D. simulans and Z). 
mauritiana genes. The comparison of these three 
sequences herein reveals marked heterogeneity in 
the levels of divergence along the sequenced re- 
gion. In particular, while the Est-6 coding region 
proves to be among the most variable of all en- 
zyme-coding genes so far compared in these spe- 
cies, the first 350bp of 5' flanking sequences prove 
to be among the least variable. 

Materials and methods 

Libraries of partial SauSA digests of genomic DNA 
from wild type D. simulans and Z). mauritiana in 
XEMBL4A (gift of Dr. T. Kaufinan) were screened 
with a nick-translated probe (Sambrook et aL^ 
1989) prepared ftom the Est'6 cDNA clone fix>m D. 
melanogaster (Oakeshott et al, 19%!), One hybr- 
idising clone for each species was chosen for fur- 
theranalysis. DNA prepared from these clones was 
digested with restriction enzymes under conditions 
specified by the enzyme manufacturers. Southern 
blot hybridizations of these digests (Reed & Maim, 
1985) were used to construct restriction maps of the 
clones and to localize the £$1-6 genes within these 
maps. Several overlapping fragments spaiming the 
Est-6 gene and flanking regions were cloned into 
plasmid vectors pTZlSU and pTZ19U (BioRad) 
using standard procedures (Sambrook et aZ, 1989). 
Single stranded plasmid DNA was prepared by the 
method of "Weiia & Messing (1987). Botii strands 
of the £sfr6 genes and flanking regions were se- 
quenced by the dideoxy chain termination method 
(Sanger et aL^ 1977). Sequences were analyzed us- 
ing version 6.0 of the University of Wisconsin Se- 
quence Analysis Software Package (Devereux et 
aL, 1984). 



Results 

Overall structural and sequence comparisons 

A 2.85kb region encompassing the Est'6 gene has 
been sequenced for both D. simulans and D. mauri- 
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Fig, L Comparisoii of the nucleotide sequences in the Est-6 regions of the tfiree spcdes, with the complete D. melanogaster (ineD 
sequence given as a reference and differences in the X), simulans (sun) and D. mauritiam (mau) sequences shown below. inserted 
in any of the three sequences to in^)rove the alignment are represented by dots. All numbering referred to in the text is that of the 
reference sequence, with the start of translation as +1. The amino add sequence mferred for A mekmogaster is also inchided, and is 
numbered wifli the first residue of the mature peptide as 1. Note that resequendng of the 5 ' region of the i>. melanogaster clone Dml45 
of Collet et aL^ (1990) m this laboratory indicates two changes to fts previously published sequence. We find a T at position -107 
(conespondingtoanAat 139 in the published sequence of Collet cr al, 1990) and a CSG at -142 to -143 (corresponding to a single 
0 at 103). 
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tiam (Genbank accession numbers L 10670 (D. 
simulans\ L1067I (Z). mauritiam)). The region 
includes l.lSkbof 5' flanking sequence, the 1.68kb 
coding region and 0.14kb of 3' untranslated se- 
quence. These sequences are aligned with the corre- 
sponding nucleotide sequence £>. melanogas- 
ter (Collet et at, 1990 and K. M. Nielsen, C, Collet 
& R. C. Richmond, pers. conun., Genbank acces- 
sion numbers M33780, M33781) in Figure 1 and 
the inferred amino acid sequences are aligned in 
Figure 2. 

The overall structure of the Est-6 gene and flank- 
ing regions is very similar among the three species. 
In D. melanogaster the coding region consists of 
two exons separated by a small intron (Collet et al^ 
1990). The locations of initiation and termination 
codons and intron splice consensus sequences indi- 
cate that this structure is conserved in both D. simu- 
lans and D, mauritiana^ although this has not been 
confirmed by cDNA or nuclease protection analy- 
sis in the latter two species. Exon I is 138 Ibp in 
length in Z>. simulans and Z), mauritiana and 
1387bp in D. melanogaster^ exon 2 is 248bp long in 
all three species and the intron is 50bp in D, simu- 
lans and D, mauritiana and 5 Ibp in D, melanogas- 
ter. The inferred amino acid sequences are 544 
residues long in D. melanogaster and 542 residues 
long in D. simulans and D. mauritiana. 

Analysis of promoter sequences is confined to 
the first l.lkb 5' of these genes because germ line 
transformation of D. melanogaster with the Est-6 
gene horn each of the three species indicates that 
this is sufficient for most aspects of wild type Est-6 
expression (J. K., M. J. Healy, M. M. Dumancic & 



J. G. O., unpubl. data). Features of this region 
shared by the three sequences include a non-cpn- 
sensus TATA box (AATAAAA, -68 to -62bp), a 
potential GC box (GGCCGGG, ^96 to -90bp), a 
potential CCAAT box (TCAAT, at -128 to 
-124bp) and seven contiguous nucleotides be- 
tween -41 and — 35bp shown by primer extension 
analysis to be the transcription initiation region in 
D. melanogaster (Collet et aL, 1990). The 3' un- 
translated region in D. melanogaster contains two 
consensus polyadenylation sequences (AATAAA, 
Wickens & Stephenson, 1984), starting 9bp and 
141bp 3' of the stop codon, which produce two 
alternate transcripts in that species (Collet et al, 
1990). Consensus polyadenylation sequences are 
present in similar positions in the i>. simulans and 
Z>. mauritiana seqiiences and northern analysis of 
adult RNA from both species suggests that they 
each produce two alternate transcripts of simil^ 
size to those of Z>. melanogaster (J. K. & J. G. O., 
unpubl, data). The second of the polyadenylation 
signals coincides with the presumed TATA box of 
the esterase P (Est-F) gene, a tandem duplication of 
Est-6 in fliese species (Collet et aL, 1990; J. K. & J. 
G. O., unpubl. data). The coding regions of Est-6 
and Est-P are only 0.20kb apart in all three species. 

Table 1 sununarizes percentage nucleotide simi- 
laritieis in both coding and non-coding regions 
among the three species. For the purpose of these 
comparisons, the Est-6 structural sequences have 
been divided into exon sil^t sites, exon replace- 
ment sites and intron sites, while the 5 ' untranslated 
and y untranslated sequences of the Est-6 tran- 
scriptional unit are treated as separate segments. 



TRible L Percentages (± binomial standard etrois) of nucleotide diffeiences in various regions of Est-6 (ignoring insertions/deletions) 
among D. melanogaster (mel), A simulans <sim) and A mauritiana (mau). n Is the number of sites compared; UT, untranslated. 
All figures are corrected for multiple substitutions by die Jukes-Cantor method (Gojobori et oL , 1990). The fourth row (mel : + mau) 
compares the D. melanogaster sequence with the consensus of the other two. 
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Fig, 5. ISstogtams showing the distributioa of nucleotide differences among the sequenced regions of £51-6 in Z>. melanogaster (mel), 
Z). swudans (sim) and D. mauritiana (mau). The three sequences were aligned simultaneously, then divided into six regions 
(5' umranscribed, 5' untranslated, exon I» intion, exon n and 3' untranslated sites). Each region was scored for nucleotide substitutions 
per 50bp interval; * where an interval contains less than 50bp, the number of sites compared is shown above the bar; R, replacement 
site differences; S, non-coding differences. The top three plots show tiie three pEorwise comparisons and the fourth compares 
D. melanogaster with the consensus of tiie other two. 
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The 1050bp of untranscribed 5' flanking sequence 
which can be aligned contiguously (i.e.' excluding 
all insertion/deletions) have been divided into 
three equal segments: a proximal segment (-42 to 
-391bp), a central segment (-392 to -741bp) and 
a distal segment (-742 to - 1091bp). Comparisons 
of the entire aligned sequences indicate, as is com- 
monly accepted, that D. simulans and D, mauri- 
tiana are more closely related to each other than 
either is to D. melanogaster(LaLchEiise€taL, 1988), 
although the distribution of nucleotide differences 
along the sequenced region is higly non-random 
(Fig. 3). 

Within the structural gene, the levels of exon 
silent site divergence across the three q)ecies are 
not significantly different to those occurring in in- 
tron sites (G = 0,17, df = 1, P > 0.05), Changes at 
silent sites and intron sites are unconstrained by 
selection to maintain protein structure or function, 
and to this extent their rates of divergence should 
both approximate neutral expectations (Kimura, 
1991). The levels of divergence in EstS replace- 
ment sites are, as expected, significantly less (5.2 
fold on average) than those m silent and intron sites 
(G - 77.61, df « 1, P < 0.001), implying that 
replacement site change is indeed constrained by 
selection to maintain EST6 structure and function. 
Replacement site variation does not differ signifi- 
cantiy between the two exons for any comparison 
(G < 1.02, df=l,P> 0.05). 

The 3' untranslated sequences within the tran- 
scriptional unit show a similar level of divergence 
to the exon silent sites and the intron, but 5' un- 
translated sites are invariate in all comparisons. 
Likewise the level of divergence among all three 
species in the proximal segment of the 5' untran- 
scribed region is significantly less (sk fold on aver- 
age) than that in exon silent, intron and 3' untrans- 
lated sites (G » 44.56, df « 1^ P < 0.001). In the 
central 5 ' untranscribed segment, the level of diver- 
gence is lower than the silent site level for the 
comparison between D. simulans and D. mauri- 
tiana (G = 7.81, df = 1, P < 0.05) but approximates 
the exon silent site levels in comparisons involving 
Z). melanogaster (G = 0. 19, df = 1, P > 0.05). In the 
distal segment, however, the level of divergence 
among all three species approximates the corre- 
sponding exon silent site level (G « 0.04, df = 1, P 
> 0,05). Ihus, outside die coding region, reduced 
divergence indicative of selective constraint is 



largely confined to the 5' untranslated region and 
die adjacent proximal segment (-41bp to -391bp) 
of untranscribed DNA. 

There is also a tendency for more numerous and 
larger insertions/deletions in the central and distal 
segments (six and five respectively, of up to lObp 
in length, compared to a single 3bp deletion in the 
proximal segment; Fig, 1). Overall, however, inser- 
tions and deletions tend to be compensatory (for 
example, D. simulans and D. mauritiana share both 
a deletion of lObp between -656 and -647 and a 
lObp insertion between -604 and -603 in the 
reference Z), melanogaster sequence), so the spac- 
ing of aligned sequences remains relatively con- 
stant. 

Amino acid differences 

Analysis of the protein coding sequences reveals a 
total of 31 amino acid diffeitences among the three 
species, including 30 amino acid substitutions and 
one insertion/deletion (Fig. 2). There is no obvious 
clustering of the substitutions along die EST6 pri- 
mary sequence, aldiough it is noteworthy that the 
one insertiori/deletion occurs in thesignal peptide, 
a region which shows a high level of interspecific 
divergence across diverse proteins. Merred fea- 
tures of the mature protein which are absolutely 
conserved among the three species include six cys- 
teine residues involved in disulfide bridges (Cooke 
& Oakeshott, 1989) and three non-contiguous resi- 
dues, Ser-188, a basic Glu-318 and an acidic His- 
445, that may be involved in a charge relay which 
donates a proton to an ester bond during ester hy- 
drolysis (Sussman et aL, 1991; Schrag etaL, 1991 
and references therein). Two alternative but now 
less likely candidates for the basic and acidic com- 
ponents of the charge relay (Asp-160 and His-408, 
Myers et aL, 1988), are also conserved among the 
species. There are four recognition sequences for 
N-linked glycosylation in the primary sequence of 
jD. melanogaster EST6, all of which are known to 
be glycosylated in diat species (Myers, 1990). Only 
three of these are conserved in the other two spe- 
cies. The fourth has been disrupted by a Ser/Val 
substitution at residue 487 which is common to 
both D, simulans and Z>. mauritiana. The latter two 
species also share a difference from D, melanogas- 
ter m the presumptive signal peptide. Two adjacent 
residues (Gly and Leu at residues -15 and -16) 
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in the hydrophobic core of the signal peptide in 
D, melanogaster are not found in D, simulcms and 
D, mauritiam. 

Figure 4 classifies each of the 30 amino acid 
substitutions as conservative or nonconservative 
for charge, molecular volume, polarity and hydro- 
phobicity. Seven of these differences are conserva- 
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Fig. 4. Comparisons of physicochemical properties of the 30 amino add difierences among the three species. The residues in die 
D, melanogaster sequence are given first, followed by die substituted reddue; dots in the D. simulans (sim) and D, mauritiam (mau) 
columns indicate identity with D, melanogaster, £Uied boxes indicating differences. Conservative (Q and nonconservative (NQ 
di£6erences are as defined by Taylor (1986). Hydropathy predictions are based on the algorithm of Kyte and Doblittle (1982): 
SUR, piotciri surfece; INT, interior, ?, mtermediate hydropathy (see Kg. 5). 



tive for all four physicochemical properties, 18 are 
conservative for three properties, while only four 
differences are nonconservative for two properties 
and one is nonconservative for three. In all, 83% of 
amino acid dijfferences are conservative for at least 
three physicochemical properties. It follows that 
there are no major differences betwe«i the hy- 
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diopathy plots for EST6 in these three species (data 
not shown) and in Figure 5 the amino acid substitu- 
tions are superimposed on a hydropathy plot of the 
D. melanogaster sequence (Kyte & Doolitde, 
1982). Of the 30 differences, sixteen are in regions 
predicted to be hydrophilic and likely to be on the 
surface of the mature protein. Thirteen of the differ- 
ences are in regions of intermediate hydropathy and 
only one is in a region predicted to be hycbophobic. 
All five differences which are each nonconserva- 
tive for two or more properties are in regions pre- 
dicted to be hydrophilic, so on the surface of the 
protein where they are less likely to affect its secon- 
dary structure. 

Previous studies have classified the common 
electrophoretically slow (EST6-S) allozymes of D. 
simulans and D. melanogaster as having identical 
mobilities (e.g. Hyyata et oL, 1985). Howev^, our 
preliminary surveys of Aiistralian and American 
populations of D. simulans (methods of Cooke et 
aLy 1987) indicate that the most common EST6-S 
allozyme has slightly less (about 2%) anodal mo- 
bility than that of the common EST6-S form of D. 
melanogaster (J. K. & J. G. O,, unpubl data). The 
gatxes sequenced herein encode a representative of 
the most common EST6-S allozyme of D. simulans 
and a fast (EST6-F) allozyme in D. mauritiana (J, 
K. & J. G. O., unpubl. data) while the D. melan- 
ogaster sequence (Collet et aL, 1990) encodes a 
representative of the most common EST6-S al- 
lozyme in natural populations (Labate et aL, 1989). 
None of the 23 amino acid differences between the 
Z), melanogaster and Z), simulans sequences result 
in a major net charge difference, although one or 
more may result in minor conformational changes 
responsible for the minor rnobility difference be- 
tween the two. The D. mauritiana sequence has a 
net negative charge with respect to the other two, 
due to a negatively charged Glu at residue 180 of 
the D. mauritiana sequence where both D. melan- 
ogaster and Z). simulans have an uncharged Gin 
residue. This more negative charge of the Z). mauri- 
tiana sequence is consistent witibi its more anodal 
electrophoretic mobility. 

Promoter variation 

As explained earlier, there is a highly non-random 
spatial distribution of nucleotide substitutions in 
the sequences 5 ' of the Est-6 initiation codon. Low 
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Fig. tf. Direct repeats and palindromes in Ae 5' flanking se- 
quences Est-6 in Z). melanogaster (mel), D. simulans (sim) and 
A mauritiana (man). Repeated or palindrome elements are un- 
derlined, colons signify inteispedfic nucleotide identities and 
lower case letters indicate intraspecific mismatches within the 
repeated or palindromic modfe. Numbers in parentheses indicate 



\^*ose elements are at least 8bp in length (allowing no mis- 
matches) and less dian 50bp apart are presented 
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levels of divergence suggestive of shared func- 
tional constraints are confined to the 5 ' untranslated 
region of the gene, the adjacent proximal segment 
of untranscribed DNA and, at least in D. simulans 
and D, mauritiana, to the next, central segment of 
5' sequence. In fact, the perfect conservation ob- 
served over the 41bp of 5' xmtranslated sequence 
continues into the proximal 5' untranscribed seg- 
ment, which contains two perfectiy conserved re- 
gions, each over lOObp long (-42bp to -I58bp 
and -219bp to -334bp in the reference D. melan- 
ogaster sequence). One of the latter contains a per- 
fect 8bp direct repeat (-120 to -liSbpand -112 
to -105bp) (Fig, 6) that may have a specific pro- 
moter function. 

The central segment of 5' sequence contains 
three different palindromic sequences, all in the 
region between -520 and -620bp, but none of 
these are perfectly conserved in all three species 
(Fig. 6). The first is a 16bp perfect palindrome 
(-536 to — 521bp) in the D. melanogaster se- 
quence which is in a similar position in the other 
two species, and is still largely, intact (6/7 matches 
in the central 14 bases). The second, a 16bp perfect 
palindrome (-619 to -606bp in the D. simulans 
sequence), shows a single mismatch in D. mauri- 
tiana but is absent in D. melanogaster, due to a 
lObp deletion at -604bp. The third is a 16bp per- 
fect palindrome in D, simulans and D. mauritiana 
(between -602 to -595bp and -569 to -562bp m 
the D. simulans sequence) that is absent in D. 
melanogaster due to several mismatches. 

The distal segment of 5' sequence in D. melan- 
ogaster contains an 8bp direct repeat (-929 to 
-922bp and -889 to -882bp) fliat is not con- 
served in D. simulans or D. mauritiana (Hg. 6). 
The repeat includes tiie motif (ACACAA at -927 
to -922bp and -887 to -882bp) which is unique 
to die 5' untranscribed region and is present a fiir- 
ther three times in the Z). melanogaster sequence (at 
-1098 to -1092bp, -939 to -934bpand -571 to 
— 566bp). Only one of these five repeats (at - 1098 
to — 1092bp) is present in the D, simulans and D. 
mauritiana sequences. Two different 16bp perfect 
palindromes, one m D. melanogaster (-860 to 
-853 and -816 to -809bp) and the other in Z>. 
mauritiana (-966 to -959bp and -912 to 
— 905bp), exhibit one or more CMsmatches in the 
other two sequences. 



Discussion 

Alipunent of the nucleotide sequences of the Est-^ 
region of D. melanogaster, D. simulans and Z). 
mauritiana, reveals very similar overall gene struc- 
tures but highly variable levels of conservation at 
die nucleotide sequence level. Conservation is low- 
est across the exon silent sites, the intron and the 3 ' 
untranslated region of the transcriptional unit and 
the most distal segment (-742 to - 109 Ibp) of the 
5 ' untranscribed region. It is generally assumed that 
the levels of divergence in non-coding regions will 
approximate the neutral mutation rate (Kimura, 
1991), and consistent witii this assumption, the 
level of silent site divergence for Est-6, although 
high, falls within the range of values observed for 
silent sites of other genes that have been sequenced 
in bofli D. melanogaster and D, simulans (Table 2). 

Nevertheless the level of exon silent site diver- 
gence does differ significantiy among the genes 
tabulated; the level for Est'6 is similar to that for 
the majority of loci listed but all of these are at least 
twofold greater than for Hsc, Adh, Hsp82 and Mtn. 
The relatively high level of divergence for Est-6 is 
not associated wiA^its .percentage of effectively 
silent sites as this statistic is relatively constaiit 
across the eleven loci compared. Although the high 
Est-6 value is consistent with the proposition 
(Shields et al, 1988; Moriyama & Gojobori, 1992) 
that the rate of silent site change will be greatest for 
loci with a low codon bias, there are enough excep- 
tions to this proposition (e.g. Sod, sal and ci^ to 
argue against any causal connection for Est-6. Sim- 
ilarly, Buhner e/ai, (1991) found significant levels 
of variation in silent site substitution rates among 
58 mammalian genes which could not be attributed 
to variatiori in codon bias alone. 

Compared to the consistentiy high levels of di- 
vergence in the silent sites, intron sites, 3' untrans- 
lated and distal 5' sequences, the exon replacement 
sites, the 5' untranslated region of the transcrip- 
tioiud unit and the proximal (-42 to -391bp) arid 
central (-393 to -741bp) 5' untranscribed regions 
exhibit varying levels of constraint. This impUes 
that shared selection pressures are acting to main- 
tain the integrity of the latter regions across the 
three species. 

The level of divergence in Est-6 replacement 
sites is only one sixth to one ninth that in exon 
silent sites and the distribution and nature of the 30 
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ThbU 2. Comparison of percent nucleotide sequence divergence (corrected for multiple hits by the Jukes-Cantor method. CSojobori et 
aL, 1990) in different regions of twelve genes sequenced in both A melanogaster and D. simulans. 5' regiais are divided into flanking 
(untranscribed) sites, untranslated (UT) sites and intion sites. Coding regions are divided into exon silent sites and replacement sites The 
percentage of effectively silent sites (%ESS. Nd and Gojobori, 1986) and the codon Has (%G + C at ftiid positions of codons) fbr'each 
coding region are also included. Lod ate listed in order of decreasing R/S value (the ratio of the percentages of replacement and exon 
silent site divergence). "O'Neil and Belote (1992); •^Oreracefoi (1990) and Benyef oil (1991); <= Reuters (1989)- <» present study 
and Collet et oL (1990); Kreitman and Hudson (1991); « Glares and abrera (1987) and Maitin-Campos et al (1992); ^ Lange et 
al (1990); * Seto etaL (1987) and Kwiatowsld et al (1989); J Ingolia and Craig (1982); k Blackman and Mcselson (1986);' ' Martin et 
clL (1988). 
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inteispecific amino acid substitutions observed pio- 
. vides further evidence tiiat replacement sites are 
selectively constrained Firstly, hydropathy plots 
drawn from the three inferred amino acid sequences 
are almost identical, reflecting the fact that most of 
die interspecific amino add, differences are phys- 
icochemically conservative. Secondly, these 30 dif- 
ferences are not clustered in the primary sequence 
and are generally not found in regions Imown to be 
important to the structure or function of the mature 
EST6 protein. One of two possible exceptions in- 
volves the fourth recognition sequence for N-linfced 
glycosylation in the D. melanogaster sequence, 
which is disrupted in both D. simulans and D, mau- 
riUana by the Ser/Val difference at residue 487. 
However, this recognition sequence is polymorphic 
in a natural population of melanogaster and, 
even when present, is not always used (Cooke & 
Oakeshott, 1989; Myers, 1990); tiiese observations 
suggest glycosylation at this site is not critical to 



EST6 function. The second possible exception is 
the presumptive signal peptide of Z>. melanogaster^ 
which is two amino acid residues longer than those 
of Z). simulans and D. maunticma; however all 
tiiree sequences remam within the accepted size 
range and exhibit the hydrophobic core and hydro- 
philic ends characteristic . of signal peptides (von 
Heijne, 1984). 

While the number and nature of the replacement 
site differences thus indicate a level of constraint on 
amino acid divergence among the three species, the 
degree of constraint is in feet weak relative to other 
loci for which comparative data are available for 
tiiese species. As Table 2 shows, only three of ten 
other sequenced loci (tra, cffl and sal) show a level 
of replacement site divergence higher than that for 
Est'6 and, significandy, all tiiree are DNA binding 
proteins (Renter et aL, 1989; Qrenic et aL, 1990; 
O'Neil & Belote, 1992). Replacement site diver- 
gence in Est'6 is the highest of the three isozyme- 
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coding loci tabulated, which is constetent with the 
high levels of EST6 amino acid polymorphism de- 
tectable by both allozyme and DNA sequence sur- 
veys within Z). melanogaster and D. simulans (Al- 
buquerque & Napp, 1981; Cooke & Oakeshott, 
1989; Labate et aL, 1989; J. K., T. M. Boyce & J. 
G. O., unpubl. data). 

EST6 is electrophoretically polymorphic in botii 
the cosmopolitan species D, melanogaster (Cooke- 
& Oakeshott, 1989) and D. simulans (Albuquerque 
& Napp, 1981) but to date only two electrophoretic 
variants have been reported in the geographically 
restricted D. mauritiana (Gonzalez et aL^ 1982). 
Latitudinal clmes in the frequencies of the major 
EST6-F and EST6-S allozymes in D, melanogaster 
and D. simulans are parallel (Anderson & Oake- 
shott, 1984), suggesting that they may be subject to 
common selective forces and have a common mo- 
lecular basis. Only two amino acid polymorphisms 
separate the major EST6-F and EST6-S variants of 
D. melanogaster^ these are good candidates as the 
target for selection underlying the clines (Cooke & 
Oakeshott, 1989). One is charge-conservative (Ala 
(EST6-F) to Thr (EST6-S) at position 247), while 
the other (negatively-charged Asp (EST6-F) to m- 
charged Asn (EST6-S) at 237) is the only net 
charge difference between the two sequences, and 
so is likely to underlie the electrophoretic mobility 
difference. 

What then do the sequences presented herein tell 
us about the EST6-F/EST6-S difference in D. simu- 
Lmsl The D. simulans sequence presented here is of 
an EST6-S variant, of similar net charge but 
slightly slower mobility than the major EST6-S 
variant of A melanogaster (Collet et aL^ 1990). 
The fact that the sequences of these two variants 
share a Thr.at 247 and an uncharged Asn at 237 is 
consistent with the possibility that the molecular 
basis of the parallel clines is the same in the two 
species, although the sequence of an EST6-F vari- 
ant of D. simulans is needed to confirm this. 

Hie D. mauritiana sequence presented here is of 
an EST6-F variant of similar net charge but even 
fester mobility than the major EST6-F variant of D. 
melanogaster (Cooke & Oakeshott, 1989). How- 
ever, the D. mauritiana EST6-F sequence is similar 
to the EST6-S variants of the other two species in 
that it also has a Thr at 247 and an uncharged Asn 
at 237. The faster mobility of the D. mauritiana 
EST6-F is best explained by the presence of a nega- 



tively charged Glu at 180, which is the only net 
charge difference between the D. mauritiana 
EST6-F sequence and the Z). melanogaster and D. 
simulans EST6-S sequences (which both have an 
uncharged Ghi at 180). Thus the faster mobility 
state of EST6-F in D. mauritiana represents a dif- 
ferent amino acid replacement to that which distin- 
guishes EST6-F and EST6-S in D. melanogaster. 
This difference between the species has no implica- 
tions for the interpretation of the latitudinal clmes 
above, since Z>. mauritiana is not known to share 
the EST6-S mobility forms found in D. simulans 
and D, melanogaster^ and its restricted geographi- 
cal range leaves no scope for the development of 
large scale latitudinal clines anyway. 

Comparison of the nucleotide sequences of D, 
melanogaster Est'6 and its homolog in D, pseu- 
doobscura (EstSB) revealed 73.6% similarity be- 
tween their inferred amino acid sequences (Brady 
et aL^ 1990); the majority of the 139 amino acid 
differences are physicochemically conservative and 
there is no evidence of clustering of the nonconser- 
vative differences in the primary sequence, similar 
to the findings of the present study. This may sim- 
ply reflect a constraint to maintain the overall stmc- 
ture and esteratic function of the enzyme, consider- 
ing the two proteins are. expressed in different tis- 
sues (Bmdy & Richmond, 1990) and presumably 
have different physiological functions. 

Nevertheless, ihe E$t-6/Est5B comparison gives 
the highest values for both replacement and silent 
site divergence of any coding region sequenced in 
both the melanogaster and obscura groups (Brady 
et aLy 1990). Both genes exhibit similar and very 
low codon bias. Assuming then that silent site di- 
vergence is largely unconstcained in both lineages, 
the ratio of replacement site to silent site diver- 
gence (R/S) should give an indication of the rela- 
tive levels of constraint on replacement site change. 
R/S for D. melanogaster versus D. simulans (0.14 
± 0.04) is significantly less (Z = 2.11, P < 0.05) 
than that for D, melanogaster versus D. pseudoob- 
scura (0.23 ± 0.02), implying that there is more 
constraint on EST6 sequences within the melan- 
ogaster group than there is between the melanogas- 
ter and obscxua groups. This contrast may reflect 
the greater similarity in the tissue and developmen- 
tal profiles of the enzyme (Aronshtam & Kuzm, 
1974), and presumably therefore in its physiol- 
ogical function, between D. melanogaster and Z>. 
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simulans as compared to D, pseudoobscura. Ele- 
vated levels of ejaculatory duct expression is a fea- 
ture of EST6 expression which is unique to the 
melanogaster complex and would suggest a qualita- 
tively different function ftom D. pseudoobscura 
(Richmond etaL, 1990). 

Comparison of the sequences 5' of the Est'6 
gene reveals several conserved regions which may 
be important in the regulation of EST6 expression. 
The proximal segment (-42 to -391bp) displays a 
low level of divergence that is clearly much less 
than the corresponding levels for exon silent sites 
and indeed is not significantly different from the 
levels for replacement sites, making it one of the 
most conserved elements of the 2.83kb of sequence 
presented here. CJonsistent with this finding, se- 
quence comparisons of Est-6 in D. melanogaster 
with EstSB in D. pseudoobscura show that the most 
conserved 5' region is the 174bp proximal to the 
translation start site in the reference D. melanogas- 
ter sequence (Brady et aU 1990). 

The selective constraints operating on the proxi- 
mal 5' sequence which we infer from these compar- 
isons are bome out by functional data. Germ line 
transformation experiments with D. melanogaster 
Est'6 show that at least the first 150bp of 5' se- 
quence axe required for basal levels of expression 
throughout development (M. J. Healy, M. M. 
Dumandc & J. G. O., unpubl. data). Precisely 
which elements within this region are required for 
expression await more detailed functional analysis. 
However, a direct repeat associated with one of the 
two perfectly conserved regions in the proximal 5' 
segment may have significance (Wingender, 1988), 
Evid^ce for this is die fact that a single element of 
this repeat (ACTGGTTG, repeated between -120 
and - 105bp in the D, melanogaster sequence) is in 
a similar position in the Z). pseudoobscura EstSB 5' 
sequence (Brady et aL, 1990) and is part of the 
longest stretch of perfect conservation among these 
four species (TGAG/4C2X?G7TC? at - 120 to -109 
in the A pseudoobscura sequence). 

The level of divergence in the central segment of 
5' sequence (-326 to -650bp) is again not signifi- 
cantly different from the corresponding value for 
exon replacement sites for D, simulans versus D. 
mauritiana (Table 1, Fig. 3). However, this is not 
true for comparisons involving Z>. melanogaster^ 
which show a level of divragence in this region 
which is not significantly different ftom that m 



silent sites. These observations imply either that 
there is less selective constramt on this region in D, 
melanogaster than in the other two species, or that 
the Z>. melanogaster sequences in this segment are 
evolving a different function. In view of the differ- 
ences among the three species in the degree of 
male-specific expression (Morton & Singh, 1985; 
J. K. & J. G. O., unpubl. data), it is noteworthy here 
that functional studies show that this central seg- 
nient is required for male-specific expression in D. 
melanogaster (M. J, Healy, M. M. Dumancic & J. 
. G. O., unpubl. data). 

One particular sequence within the central 5' 
segment, a 16bp palindrome \rtiich is conserved 
across all three species (Fig. 6), is a candidate for a 
shared regulatory sequence. This palindrome con- 
tains an Rsal site, a polymorphism for which is 
associated with variation in male EST6 activity in 
a natural population of Z). melanogaster (Game & 
Oakeshott, 1990). The palindrome is also present in 
the 5' flanldng sequence of EstSB in D, pseudoob- 
scura, (Brady et aL, 1990) and in a known regula- 
tory region of the Drosophila per gene (Jackson et 
aLy 1986), both of which also show a degree of 
specificity for male reproductive tissue (Giebul- 
towicz et aL, 1988; Oakeshott et aL, 1990). How- 
ever, the correlation with male specificity is not 
invariant the palindrome is lackmg from another 
gene, GW, which is also expressed in the male 
reproductive tract (Cavener, 1992 and references 
therein). 

Levels of nucleotide sequence divergence in the 
distal 5' segment approximate those in exon silent 
sites. This could indicate either an absence of pro- 
moter elements requiring conservation or perhaps 
the presence of different functional promoter ele- 
ments causing differences in expression among the 
three species. The latter explanation is consistent 
with the finding from the germ line transformation 
experimmts within Z). melanogaster that this re- 
gion is required for wild type levels of expression in 
a variety of tissues (M. J. Healy, M. M. Dumancic 
& J. G. O., unpubL data). EST6 activity levels are 
highly variable both within and among these spe- 
cies, (Game & Oakeshott, 1990; J. K, & L G. O., 
unpubl. data), although further interspecific analy- 
ses are needed to determine the contributions that 
various tissues ihake to the overall activities. It is 
interesting in this respect that an 8bp direct repeat 
in the distal segment of the Z). melanogaster se- 
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quetice is absent in the two other sequences; this 
repeat could be involved in an aspect of expression 
unique to D, melanogaster^ particularly since it in- 
cludes a motif (ACACAA), which is unique to the 
5' untranslated region and present five times in the 
D. melanogaster sequence, but only once m bodi 
the D, simulans and D. mauritiana sequences. 

Table 2 compares the levels of divergence in the 
5' untranslated regions of the Est-6 transcriptional 
unit and in the proximal segment of the untran- 
scribed regions with other genes for which appro- 
priate sequence data are available for both Z>. 
melanogaster and D, simulans. While the 5' un- 
translated region of Est-6 is perfectly conserved, in 
other genes it contains multiple substitutions and 
insertion/deletions. Two genes (Adh and Hsp82) 
have introns within the 5' untranslated sequence 
and these are mote variable again. Likewise the 
proximal 5' segment of untraniscribed DNA is at 
least as conserved m Est-6 as the equivalent length 
of 5 ' sequence for other genes. Taken together with 
the relatively weak constraint on the exon replace- 
ment sites for £sr-6, this suggests that stabilising 
selection on the esterase-6 gene-enzyme system is 
relatively weak with respect to its amino acid se- 
quence and relatively strong with respect to the 
regulation of its expression. We propose that this 
reflects the selective constraints on the sex, tissue 
and temporal specificity of its expression; the major 
pulse of expression in all three species occurring in 
the ejaculatory ducts of adult males. 
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Table I (Continued) 



Enzyme and Residues'* 

Active Center— Choline Binding Site . 
HA Y1137A 
MA Y337A,F 
TA E.wQ.D 

HA EanQ.D.A 

HA Wh^A 

HB Y446D 

Gorge Entry (Peripheral Anionic Site) 
HA D74 E,N.G.K 

MA D74N 
HB 0700*= 
DC Y,wD.O,K 

TA W^T^ 

HA W^iA 

MA W28^ 

MA WasjA 

MA YnN 

MA Y,24Q 

MA YtjN; Y,„Q 

MA YtjN; W,^^ 

MA Y,2,Q: W,,^R 

MA Y72N; Y.mQ; W,^, R.A 

MA Y72N; Y,24Q: W:a^,A; D74N 



Torpedo 
Equivalent 



Catalytic » Inhibitor Specificity 
and Structural Change 



Reference 



330 I substrate inhibition 1 30 

330 Change in inhibitor specificity (esp. A) 123 

i^jK^ i substrate inhibition (esp. D), change in 132, 53, 166 

inhibitor specificity, diminished aging rate 

1^ I J- substrate inhibition, change in inhibitor 130 
specificity 

S4 iW^mATCh. i prupidium affinity, i edrophonium 130.138 
affinity 

442 change In inhibitor specificity 135 

iflijquatemary. propidium, and dibucaine inhibition: 129, 130 
i Substrate inhibition 

•72 1f(r.lK^. 123 

72 Succinylcholine and dibucaine inhibition 125. 140 

72 G t preference BTCh 142 

K lower substrate affinity 

279 i Propidium and Wjquaiemary inhibition 78 

279 I Propidium and 6/jquateniary inhibition 130 

279 i Propidium and Wjquatemaiy inhibition 123 

279 i Propidium and Wjquatemaiy inhibition 123 

70 1 Propidium and Wiquatemary inhibition 123 

121 i Propidium and Wjquatemary inhibition 123 

70. 121 i Propidium and Wjquatemary inhibition 123 

70, 272 i Propidium and Wjquatemary inhibition 123 

121. 279 I Propidium and Wjquatemary inhibirion 1 23 

70, 121, 279 i Propidium and A/^quaiemary inhibition 123 

see above [Propidium and ^/jquateniary inhibition 123 



Other Catalytic and Structural Functions 

HB E44,G.E443G 

HA Y„4A 

HB Fas.Y 

HB 5425?** 

HB G3wV« 

HA H322NP 

HA P^e.R^ 

HA F338A 

MA F338G 

HA Y34, 

DC FhsS-^ 
DC I,„V- 
DC G303A'' 

jntersubunit Association 

"TA C537. truncation 

HA C580A 

DC C6I5, truncation 

Glycosylation 

HA N265Q 

HA N350Q 

HAN^Q 

HA N2<s5Q.N35oQ 

HA N35qQ,N464Q 

. HA N763Q,N35oQ.N464Q 



443. 445 

U6 

563 

427 

392 

315 

541 

331 

331 

334 

78 

129 

227 



537 
572 
537 

258 
343 
457 

258, 393 
258. 457 
343. 457 
258, 343. 457 



Ctecreased BTCh catalysis and dibucaine inhibition 
Restores function to D70 mutants 
Restores function to I>7o mutants 
Associated with D70 resistance 
i Succinylcholine. dibucaine and tacrine inhibition 
YT blood group antigen 

Allelic variation in glycophospholipid signal sequence 
Associates with Fj^^ 
Associates with F^,.,;. \ K^, 
1 Substrate inhibition 
increased organophosphate resistance 
Increased organophosphate resistance 
Increased organophosphate resistance 

Secreted 

Secreted monomer 
Secreted 



Diminished secretion 
Diminished secretion 
Diminished secretion 
Greater diminution of secretion 
Greater diminution of secretion 
Greater diminution of secretion 
Greater diminuiion of secretion 



140 
140 
140 

125. 140 
163 
161 
161 

138. 130 

123 

138 

164 

164 

164 



126. 165 
128 

145. 146. 151 

150 
150 
150 
150 
150 
150 
150 



Table 1 (Continued) 



Enzyme and Residues** 



Chimerae 



TA Exonv4 deletion, exon 3-5 linkage 
HB Linkage of mutant and non-mutant 
enzymes 

MA Substituted N-tcrminal and/or 
C-temiinal sequences with BuChE 
sequence 

HE Substituted AOiE sequence for 
BuChE 



Torpedo 
Equivalent 



various 



^l-l7*A,75-575 
B|_|74Ai73_487- 

E41ia-575 
E|^Ajft_|33- 



Catalytic. Inhibitor Specificity 
and Structural Change 



Glycophospholipid-linked inactive enzyme 
Augments or diminishes influence of the mutant 

B,.,74 confers BW specificity of BuChE 
Imparts partial AChB character 



Reference 



126 

125. 140 
131 



141 



'MA « Mouse acctyJchoIiDesteiase: HA » human acetylcholinesterase* TA « T/»«»»^o ..^^ui^v ' 7Z T " 

cbolinesterase a«xyicnwinesterase, TA « Torpedo acetykhoJmcsierasc: HB = human butyrylcboUnesterase. DC = Drosophila 



Phe290 




BUTYRYLCHOLINE 



Arg 289 288 




His 440 




Trpa4 



Arg 289 Leu 288 



Figure 4 Structure of bound buiyrylchoUne within the substrate bindins site of the F Mnrcnt a... . k 
Energy mtmmtzation was done with the Biosym fnsight a program S) Acetylcholmesterase. 
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